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(57) Abstract: The present 
invention provides a method for 
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recombinant gene encoding a 
protein comprising the sequence 
of a first chaperone protein, 
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encoding a protein comprising the 
sequence of a second chaperone 
protein and a third gene, such as a 
third recombinant gene, encoding 
a desired protein (such as a 
desired heterologous protein), 
wherein the first and second 
chaperone s are different; and (b) 
culturing the host cell in a culture 
medium to obtain expression of 
the first, second and third genes. 
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GENE EXPRESSION TECHNIQUE 
FIELD OF THE INVENTION 
5 The present application relates to gene expression techniques. 
BACKGROUND OF THE INVENTION 

The listing or discussion of a prior-published document in this specification 
10 should not necessarily be taken as an acknowledgement that the document is part 
of the state of the art or is common general knowledge. 

The class of proteins know as chaperones has been defined by Haiti (1996, 
Nature, 381, 571-580) as a protein that binds to and stabilises an otherwise 
15 unstable conformer of another protein and, by controlled binding and release, 
facilitates its correct fate in vivo, be it folding, oligomeric assembly, transport to a 
particular subcellular compartment, or disposal by degradation. 

BiP (also know as GRP78, Ig heavy chain binding protein and Kar2p in yeast) is 
20 an abundant ~70kDa chaperone of the hsp 70 family, resident in the endoplasmic 
reticulum (ER), which amongst other functions, serves to assist in transport in the 
secretory system and fold proteins. 

Protein disulphide isomerase (PDI) is a chaperone protein, resident in the ER that 
25 is involved in the catalysis of disulphide bond formation during the post- 
translational processing of proteins. 

Studies of the secretion of both native and foreign proteins have show that transit 
from the ER to the Golgi is the rate-limiting step. Evidence points to a transient 
30 association of the BiP with normal proteins and a more stable interaction with 
mutant or misfolded forms of a protein. As a result, BiP may play a dual role in 
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solubilising folding precursors and preventing the transport of unfolded and 
unassembled proteins. Robinson and Wittrup, 1995, Biotechnol Prog. 11, 171- 
177 3 have examined the effect of foreign protein secretion on BiP (Kar2p) and 
PDI protein levels in Saccharomyces cerevisiae and found that prolonged 
5 constitutive expression of foreign secreted proteins reduces soluble BiP and PDI 
to levels undetectable by Western analysis. The lowering of ER chaperone and 
foldase levels as a consequence of heterologous protein secretion has important 
implications for attempts to improve yeast expression/secretion systems. 

10 Expression of chaperones is regulated by a number of mechanisms, including the 
unfolded protein response (UPR). 

Using recombinant techniques, multiple PDI gene copies have been shown to 
increase PDI protein levels in a host cell (Farquhar et al, 1991, Gene, 108, 81-89). 

15 

Co-expression of the gene encoding PDI and a gene encoding a heterologous 
disulphide-bonded protein was first suggested in WO 93/25676, published on 23 
December 1993, as a means of increasing the production of the heterologous 
protein. WO 93/25676 reports that the recombinant expression of antistasin and 
20 tick anticoagulant protein can be increased by co-expression with PDI. 

This strategy has been exploited to increase the recombinant expression of other 
types of protein. 

25 Robinson et al, 1994, Bio/Technology, 12, 381-384 reported that a recombinant 
additional PDI gene copy in Saccharomyces cerevisiae could be used to increase 
the recombinant expression of human platelet derived growth factor (PDGF) B 
homodimer by ten-fold and Schizosacharomyces pombe acid phosphatase by four- 
fold. 

30 
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Hayano et al, 1995, FEES Letters, 377, 505-511 described the co-expression of 
human lysozyme and PD1 in yeast. Increases of around 30-60% in functional 
lysozyme production and secretion were observed. 

5 Shusta et al, 1998, Nature Biotechnology, 16, 773-777 reported that the 
recombinant expression of single-chain antibody fragments (scFv) in 
Saccharomyces cerevisiae could be increased by between 2-8 fold by over- 
expressing PDI in the host cell. 

10 Bao & Fukuhara, 2001, Gene, 272, 103-110 reported that the expression and 
secretion of recombinant human serum albumin (rHSA) in the yeast 
Kluyveromyces lactis could be increased by 15-fold or more by co-expression with 
an additional recombinant copy of the yeast PDI gene (KIPDI1). 

15 In order to produce co-transformed yeast comprising both a PDI gene and a gene 
for a heterologous protein, WO 93/25676 taught that the two genes could be 
chromosomally integrated; one could be chromosomally integrated and one 
present on a plasmid; each gene could be introduced on a different plasmid: or 
both genes could be introduced on the same plasmid. WO 93/25676 exemplified 

20 expression of antistasin from the plasmid pKH4oc2 in yeast strains having a 
chromosomally integrated additional copy of a PDI gene (Examples 16 and 17); 
expression of antistasin from the vector K991 with an additional PDI gene copy 
being present on a multicopy yeast shuttle vector named YEp24 (Botstein et al, 
1979, Gene, 8, 17-24) (Example 20); and expression of both the antistasin and the 

25 PDI genes from the yeast shuttle vector pCl/1 (Rosenberg et al, 1984, Nature, 
312, 77-80) under control of the GAL 10 and GAL1 promoters, respectively. 
Indeed, Robinson and Wittrup, 1995, op. cit., also used the GAL 1 -GAL 10 
intergenic region to express erythropoietin and concluded that production yeast 
strains for the secretion of heterologous proteins should be constructed using 

30 tightly repressible, inducible promoters, otherwise the negative effects of 

3 
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sustained secretion (i.e. lowered detectable BiP and PDI) would be dominant after 
the many generations of cell growth required to fill a large-scale fermenter. 

Subsequent work in the field has identified chromosomal integration of transgenes 
as the key to maximising recombinant protein production. 

Robinson el al, 1994, op. cit, obtained the observed increases in expression of 
PDGF and S. pombe acid phosphatase using an additional chromosomally 
integrated PDI gene copy. Robinson et al reported that attempts to use the multi- 
copy 2\xm expression vector to increase PDI protein levels had had a detrimental 
effect on heterologous protein secretion. 

Hayano et al, 1995, op. ext. described the introduction of genes for human 
lysozyme and PDI into a yeast host each on a separate linearised integration 
vector, thereby to bring about chromosomal integration. 

Shusta et al, 1998, op. ciL, reported that in yeast systems, the choice between 
integration of a transgene into the host chromosome versus the use of episomal 
expression vectors can greatly affect secretion and, with reference to Parekh & 
Wittrup, 1997, Biotechnol Prog., 13, 117-122, that stable integration of the scFv 
gene into the host chromosome using a 5 integration vector was superior to the use 
of a 2^m-based expression plasmid. Parekh & Wittrup, op. ext., had previously 
taught that the expression of bovine pancreatic trypsin inhibitor (BPTI) was 
increased by an order of magnitude using a 5 integration vector rather than a 2pm- 
based expression plasmid. The 2|am-based expression plasmid was said to be 
counter-productive for the production of heterologous secreted protein. 

Bao & Fukuhara, 2001, op. ext., reported that "It was first thought that the KIPDI1 
gene might be directly introduced into the multi-copy vector that carried the rHSA 
expression cassette. However, such constructs were found to severely affect yeast 
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growth and plasmid stability. This confirmed our previous finding that the 
KIPDI1 gene on a multi-copy vector was detrimental to growth of A: lactis cells 
(Bao et al 2000)". Bao et al, 2000, Yeast, 16, 329-341, as referred to in the 
above-quoted passage of Bao & Fukuhara, reported that the KIPDI1 gene had 
5 been introduced into K lactis on a multi-copy plasmid, pKan707, and that the 
presence of the plasmid caused the strain to grow poorly. Bao et al concluded that 
over-expression of the KIPDU gene was toxic to K. lactis cells. In the light of the 
earlier findings in Bao et al, Bao & Fukuhara chose to introduce a single 
duplication of KIPDI1 on the host chromosome. 

10 

Against this background, we had previously surprisingly demonstrated that, 
contrary to the suggestions in the prior art, when the genes for a chaperone protein 
and a heterologous protein are co-expressed on a 2|^m-family multi-copy plasmid 
in yeast, the production of the heterologous protein is substantially increased. 

15 

Our earlier application, which has been published as WO 2005/061718, from 
which this application claims priority, disclosed a method for producing 
heterologous protein comprising: 

20 (a) providing a host cell comprising a 2pm-family plasmid, the 

plasmid comprising a gene encoding a protein comprising the sequence of 
a chaperone protein and a gene encoding a heterologous protein; 

(b) culturing the host cell in a culture medium under conditions that 
25 allow the expression of the gene encoding the chaperone protein and the 

gene encoding a heterologous protein; 

(c) purifying the thus expressed heterologous protein from the culture 
medium; and 

30 
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(d) optionally, lyophilising the thus purified protein. 

As discussed above, Bao ei al, 2000, Yeast, 16, 329-341 reported that over- 
expression of the K. lactis PD1 gene KIPDI1 was toxic to K. lactis cells. Against 
5 this background we have surprisingly found that, not only is it possible to over- 
express PDI and other chaperones without the detrimental effects reported in Bao 
el al, but that two different chaperones can be recombinantly over-expressed in the 
same cell and, rather than being toxic, can increase the expression of proteins, 
including heterologous proteins, to levels higher than the levels obtained by 

10 individual expression of either of the chaperones. This was not expected. On the 
contrary, in light of the teaching of Bao et aL one would think that over- 
expression of two chaperones would be even more toxic than the over-expression 
of one. Moreover, in light of some initial findings which are also reported below 
in the present application, it was expected that the increases in heterologous 

15 protein expression obtained by co-expression with a single chaperone would be at 
the maximum level possible for the cell system used. Therefore, it was 
particularly surprising to find that yet farther increases in protein expression could 
be obtained by co-expression of two different chaperones with the protein. 

20 SUMMARY OF THE INVENTION 

Accordingly, as a first aspect of the present invention there is provided a method 
for producing a desired protein, such as a heterologous protein, comprising 
providing a host cell comprising a first recombinant gene encoding a protein 

25 comprising the sequence of a first chaperone protein, a second recombinant gene 
encoding a protein comprising the sequence of a second chaperone protein and a 
third gene (optionally the third gene being recombinant) encoding the desired 
protein (optionally a heterologous protein), wherein the first and second 
chaperones are different; and culturing the host cell in a culture medium under 

30 conditions that allow the expression of the first, second and third genes. 
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Optionally the thus expressed desired protein may, or may not, be purified from 
the cultured host cell or the culture medium. 

Optionally, the thus purified desired protein may, or may not, be lyophilised. 

The method may, or may not, further comprise the step of formulating the purified 
desired protein with a carrier or diluent and optionally presenting the thus 
formulated protein in a unit dosage form, in the manner discussed above. 

The term "recombinant gene" includes nucleic acid sequences that operate 
independently as "stand alone" expressible sequences to produce an encoded 
protein or, in the alternative, nucleic acid sequences introduced that operate in 
combination with endogenous sequences (such as by integration into an 
endogenous sequence so as to produce a nucleic acid sequence that is different to 
the endogenous sequence) within the host to cause increased expression of a target 
protein. 

A "recombinant gene" is typically a gene that is not naturally found in the context 
used. For example, a gene that is integrated, at an integration site, into the 
chromosome of a host organism can be said to be a "recombinant gene" if it 
comprises a sequence that does not naturally occur at the integration site. Thus, 
the "recombinant gene" may, or may not, comprise a non-natural sequence in the 
coding, regulatory or any other region of the gene, or may, or may not, comprise 
the sequence of a naturally occurring gene but be introduced into the chromosome 
of a host organism at an integration site at which that sequence does not naturally 
occur. The same issues apply, mutatis mutandis, to the insertion of a 
"recombinant gene" into a plasmid. 

The terms "chromosomally integrated" and "integrated into the chromosome of 

the host cell" are well recognised terms of the art. For avoidance of doubt, these 

terms include the integration of polynucleotide sequences in any inheritable 

7 
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nuclear material that naturally occurs in a host cell, other than for naturally 
occurring plasmids. Thus, a polynucleotide sequence that is "integrated into the 
chromosome of the host cell" may, or may not, be integrated into the chromosome 
of a procaryotic (such as a bacterial) cell, or into any part of the genome of a 
eucaryotic cell, such as into nuclear genetic material including the chromosome 
(or, one of the chromosomes), the mitochondrial genome or the chloroplast 
genome. 

The first and second chaperones may, or may not, each individually, be one of the 
specifically listed chaperones as discussed below, and are a combination of 
chaperones that, when co-expressed in the same host cell, provide at least an 
additive effect to the increase in expression of the desired protein. By "additive 
effect" we mean that the level of expression of the desired protein in the host cell 
is higher when the first and second recombinant genes are simultaneously co- 
expressed with the third gene as compared to the same system wherein (i) the first 
recombinant gene is co-expressed with the third gene in the absence of the 
expression of the second recombinant gene and (ii) the second recombinant gene 
is co-expressed with the third gene in the absence of the expression of the first 
recombinant gene. 

The term "chaperone" as used herein refers to a protein that binds to and stabilises 

an otherwise unstable conformer of another protein, and by controlled binding and 

release, facilitates its correct fate in vivo, be it folding, oligomeric assembly, 

transport to a particular subcellular compartment, or disposal by degradation. 

Accordingly a chaperone is also a protein that is involved in protein folding/or 

which has chaperone activity or is involved in the unfolded protein response.- 

Chaperone proteins of this type are known in the art, for example in the Stanford 

Genome Database (SGD), http://db.yeastgenome.org or 

http://www.yeastgenome.org. Preferred chaperones are eucaryotic chaperones, 

especially preferred chaperones are yeast chaperones, including AHA], CCT2, 

CCT3. CCT4, CCT5, CCT6, CCT7, CCT8. CNSJ, CPR3, CPR6, EROl, EUG1, 

8 
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FMOl, HCH1, HSP10, HSP12, HSP.l 04, HSP26, HSP30, HSP42, HSP60. 
HSP78, HSP82.JEM1, MDJ1, MDJ2, MP DP, MPD2, PDI1, PFD1, ABC1, APJ1, 
ATP] 1, ATP 12, BTT1 . CDC 37, CPR7, HSC82, KAR2, LHS1, MGE1, MRS IP 
NOB1, ECM10, SSAJ, SSA2, SSA3, SSA4, SSC1, SSE2, SIL1, SLSP ORMJ, 
5 0RM2. PERI, PTC2, PSEP UB14 and HAC1 or a truncated intronless HAC1 
(Valkonen et al. 2003, Applied Environ. Micro., 69, 2065), as well as TIM9, 
PAM18 (also known as TIM14) and TCP1 (also known as CCT1) 

A chaperone useful in the practice of the present invention may, or may not, be: 

10 

• a heat shock protein, such as a protein that is a member of the hsp70 
family of proteins (including Kar2p, SSA and SSB proteins, for example 
proteins encoded by SSA1, SSA2, SSA3, SSA4, SSB1 and SSB2), a protein 
that is a member of the HSP90-family, or a protein that is a member of the 

15 HSP40-family or proteins involved in their modulation (e.g. Sillp), 

including DNA-J and DNA-J-like proteins (e.g. Jemlp, Mdj2p); 

• a protein that is a member of the karyopherin/importin family of proteins, 
such as the alpha or beta families of karyopherin/importin proteins, for 

20 example the karyopherin beta protein PSE1 ; 

• a protein that is a member of the ORMDL family described by Hjelmqvist 
et al, 2002, Genome Biology, 3(6), research0027. 1-0027. 16, such as 
Orm2p. 

25 

• a protein that is naturally located in the endoplasmic reticulum or 
elsewhere in the secretory pathway, such as the golgi. For example, a 
protein that naturally acts in the lumen of the endoplasmic reticulum (ER), 
particularly in secretory cells, such as PD1 
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a protein that is transmembrane protein anchored in the ER 5 such as a 
member of the ORMDL family described by Hjelmqvist et aL 2002, supra, 
(for example, Orm2p); 

a protein that acts in the cytosol, such as the hsp70 proteins, including SSA 
and SSB proteins, for example protein production SSA J, SSA2, SSA3 } 
SSA4 } SSB J and SSB2; 

a protein that acts in the nucleus, the nuclear envelope and/or the 
cytoplasm, such as Pselp: 

a protein that is "essential" to the viability of the cell, such as PDI, or a 
protein encoded by one of the following genes: CCT2, CCT3, CCT4, 
CCT5, CCT6, CCT7, CCT8, CMS/, EROl, HSP10 9 HSP60, PDI1, CDC 3 7, 
MGE1, MRSU, NOB1, SSC1, TIM9, PAM18 and TCP1, or a 
protein that is an essential karyopherin protein, such as Pselp; 

a protein that is involved in sulphydryl oxidation or disulphide bond 
formation, breakage or isomerization, or a protein that catalyses 
thiokdisulphide interchange reactions in proteins, particularly during the 
biosynthesis of secretory and cell surface proteins, such as protein 
disulphide isomerases (e.g. Pdilp, Mpdlp), homologues (e.g. Euglp) 
and/or related proteins (e.g. Mpd2p, Fmolp, Erolp); 

a protein that is involved in protein synthesis, assembly or folding, such as 
PDI and Ssalp; 

a protein that binds preferentially or exclusively to unfolded, rather than 
mature protein, such as the hsp70 proteins, including SSA and SSB 
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proteins, for example proteins encoded by SSA1, SSA2, SSA3, SSA4, SSB1 
and SSB2; 

a protein that prevents aggregation of precursor proteins in the cytosol, 
such as the hsp70 proteins, including SSA and SSB proteins, for example 
proteins encoded by SSA J, SSA2, SSA3, SSA4, SSB1 and SSB2: 

a protein that binds to and stabilises damaged proteins, for example Ssalp; 

a protein that is involved in the unfolded protein response or provides for 
increased resistance to agents (such as tunicamycin and dithiothreitol) that 
induce the unfolded protein response, such as a member of the ORMDL 
family described by Hjelmqvist et al, 2002, supra (for example, Orm2p) or 
proteins involved in the response to stress (e.g. Ubi4p); 

a protein that is a co-chaperone and/or a protein indirectly involved in 
protein folding and/or the unfolded protein response (e.g. hspl04p, 
Mdjlp); 

a protein that is involved in the nucleocytoplasmic transport of 
macromolecules, such as Pselp; 

a protein that mediates the transport of macromolecules across the nuclear 
membrane by recognising nuclear location sequences and nuclear export 
sequences and interacting with the nuclear pore complex, such as PSE1 ; 

a protein that is able to reactivate ribonuclease activity against RNA of 
scrambled ribonuclease as described in as described in EP 0 746 611 and 
Hillson et al, 1 984, Methods Enzymol, 107, 281-292, such as PDI; 
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10 



e a protein that has an acidic pi (for example, 4.0-4.5), such as PDI; 

a protein that is a member of the Hsp70 family, and optionally possesses 
an N-terminal ATP-binding domain and a C-terminal peptide-binding 
domain, such as Ssalp. 

• a protein that is a peptidyl-prolyl cis-trans isomerases (e.g. Cpr3p, Cpr6p); 

• a protein that is a homologue of known chaperones (e.g. Hspl Op); 

• a protein that is a mitochondrial chaperone (e.g Cpr3p); 

• a protein that is a cytoplasmic or nuclear chaperone (e.g Cnslp); 

15 • a protein that is a membrane-bound chaperone (e.g. Orm2p, Fmolp); 

a protein that has chaperone activator activity or chaperone regulatory 
activity (e.g. Ahalp, Haclp, Hchlp); 

20 • a protein that transiently binds to polypeptides in their immature form to 
cause proper folding transportation and/or secretion, including proteins 
required for efficient translocation into the endoplasmic reticulum (e.g. 
Lhslp) or their site of action within the cell (e.g. Pselp); 

25 • a protein that is a involved in protein complex assembly and/or ribosome 
assembly (e.g. Atpl lp, Pselp, Noblp); 

• a protein of the chaperonin T-complex (e.g. Cct2p); 

30 • a protein of the prefoldin complex (e.g. Pfdlp); 

12 
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a mitochondrial intermembrane space protein such as Tim9p; 

a protein that can form a complex, in vivo, with Mrsl Ip/TimlOp, such as 
Tim9p; 

a protein that is involved in the mediation of import and insertion of 
polytopic inner membrane proteins, such as Tim9p; 

a protein that can prevent the aggregation of incoming proteins, such as 
Tim9p; 

a protein that can be a functional constituent of the mitochondrial import 
motor associated with presequence translocase (along with Ssclp, Tim44p, 
Mgelp and Paml6p) such as Paml8p: 

a protein that can stimulate the ATPase activity of Ssclp 5 such as to drive 
mitochondrial import, such as PamlSp; 

a protein that contains a J domain, such as Paml 8p; 

a protein that can act as an alpha subunit of chaperonin-containing T- 
complex, which mediates protein folding in the cytosol, such as Tcplp; 

a protein that can play a role in the in maintenance of an actin 
cytoskeleton, such as Tcplp; or 

a protein that is, or is a homolog to, a Drosophila melanogaster or mouse 
tailless complex polypeptide, such as Tcplp. 
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A preferred chaperone is protein disulphide isomerase (PDI) or a fragment or 
variant thereof having an equivalent ability to catalyse the formation of disulphide 
bonds within the lumen of the endoplasmic reticulum (ER). By "PDI" we include 
any protein having the ability to reactivate the ribonuclease activity against RNA 
of scrambled ribonuclease as described in EP 0 746 611 and Hillson el al, 1984, 
Methods Enzymol. , 107, 28 1-292. 

PDI is an enzyme which typically catalyzes thiohdisulphide interchange reactions, 
and is a major resident protein component of the ER lumen in secretory cells. A 
body of evidence suggests that it plays a role in secretory protein biosynthesis 
(Freedman, 1984, Trends Biochem. Set, 9, 438-41) and this is supported by direct 
cross-linking studies in situ (Roth and Pierce, 1987, Biochemistry, 26, 4179-82). 
The finding that microsomal membranes deficient in PDI show a specific defect in 
cotranslational protein disulphide (Bulleid and Freedman, 1988, Nature, 335, 649- 
51) implies that the enzyme functions as a catalyst of native disulphide bond 
formation during the biosynthesis of secretory and cell surface proteins. This role 
is consistent with what is known of the enzyme's catalytic properties in vitro: it 
catalyzes thiol: disulphide interchange reactions leading to net protein disulphide 
formation, breakage or isomerization, and can typically catalyze protein folding 
and the formation of native disulphide bonds in a wide variety of reduced, 
unfolded protein substrates (Freedman et al., 1989, Biochem. Soc: Symp., 55, 167- 
192). PDI also functions as a chaperone since mutant PDI lacking isomerase 
activity accelerates protein folding (Hayano et al, 1995, FEBS Letters, 377, 505- 
511). Recently, sulphydryl oxidation, not disulphide isomerisation was reported 
to be the principal function of Protein Disulphide Isomerase in S. cerevisiae 
(Solovyov et al, 2004, J. Biol. Chem., 279 (33) 34095-34100). The DNA and 
amino acid sequence of the enzyme is known for several species (Scherens et al, 
1991, Yeast, 1, 185-193; Farquhar et al, 1991, Gene, 108, 81-89; EP074661; 
EP0293793; EP0509841) and there is increasing information on the mechanism of 
action of the enzyme purified to homogeneity from mammalian liver (Creighton et 
al, 1980. J. Mol. Biol, 142, 43-62; Freedman et al, 1988, Biochem. Soc. Trans., 
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16, 96-9: Gilbert, 1989, Biochemistry, 28, 7298-7305; Lundstrom and Holmgren, 
1990, J. Biol. Chem., 265, 9114-9120: Hawkins and Freedman, 1990, Biochem. J., 
275, 335-339). Of the many protein factors currently implicated as mediators of 
protein folding, assembly and translocation in the cell (Rothman, 1989, Cell, 59, 
591-601), PDI has a well-defined catalytic activity. 

The deletion or inactivation of the endogenous PDI gene in a host results in the 
production of an inviable host. In other words, the endogenous PDI gene is an 
"essential" gene. 



PDI is readily isolated from mammalian tissues and the homogeneous enzyme is a 
homodimer (2x57 kD) with characteristically acidic pi (4.0-4.5) (Hillson et al, 
1984, op. cit.). The enzyme has also been purified from wheat and from the alga 
Chlamydomonas reinhardii (Kaska et al, 1990, Biochem. J., 268, 63-68), rat 

15 (Edman et al, 1985, Nature, 317, 267-270), bovine (Yamauchi et al, 1987, 
Biochem. Biophys. Res. Comm., 146, 1485-1492), human (Pihlajaniemi et al, 
1987, EMBO J., 6, 643-9), yeast (Scherens et al, supra; Farquhar et al, op. cit.) 
and chick (Parkkonen et al, 1988, Biochem. ./., 256, 1005-1011). The proteins 
from these vertebrate species show a high degree of sequence conservation 

20 throughout and all show several overall features first noted in the rat PDI sequence 
(Edman et al, 1985, op. cit.). 

Preferred PDI sequences include those from humans and those from yeast species, 
such as S. cerevisiae. 



A yeast protein disulphide isomerase precursor, PDI1, can be found as Genbank 
accession no. CAA42373 or BAA00723 and has a sequence of 522 amino acids as 
described in WO 2005/061718, the contents of which are incorporated herein by 
reference. 
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An alternative yeast protein disulphide isomerase sequence can be found as 
Genbank accession no. CAA38402, which has a sequence of 530 amino acids as 
described in WO 2005/061718, the contents of which are incorporated herein by 
reference. 

The alignment of these sequences (the sequence of Genbank accession no. 
CAA42373 or BAA00723 first, the sequence of Genbank accession no. 
CAA38402 second) in WO 2005/061718, the contents of which are incorporated 
herein by reference, shows that the differences between these two sequences are a 
single amino acid difference at position 114 (highlighted in bold) and that the 
sequence defined by Genbank accession no. CAA38402 contains the additional 
amino acids EADAEAEA at positions 506-5 13. 

Variants and fragments of the above PDI sequences, and variants of other naturally 
occurring PDI sequences are also included in the present invention. A "variant", in 
the context of PDI, refers to a protein wherein at one or more positions there have 
been amino acid insertions, deletions, or substitutions, either conservative or non- 
conservative, provided that such changes result in a protein whose basic properties, 
for example enzymatic activity (type of and specific activity), thermostability, 
activity in a certain pH-range (pH-stability) have not significantly been changed. 
"Significantly" in this context means that one skilled in the art would say that the 
properties of the variant may, or may not, still be different but would not be 
unobvious over the ones of the original protein. 

25 By "conservative substitutions" is intended combinations such as Val, He, Leu, Ala, 
Met; Asp, Glu; Asn, Gin; Ser, Thr, Gly, Ala; Lys, Arg, His; and Phe, Tyr, Trp. 
Preferred conservative substitutions include Gly, Ala; Val, He, Leu; Asp, Glu; Asn, 
Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 

30 A "variant" typically has at least 25%, at least 50%, at least 60% or at least 70%, 
preferably at least 80%, more preferably at least 90%, even more preferably at least 
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95%, yet more preferably at least 99%, most preferably at least 99.5% sequence 
identity to the polypeptide from which it is derived. 

The percent sequence identity between two polypeptides may be determined using 
5 suitable computer programs, as discussed below. Such variants may, or may not, 
be natural or made using the methods of protein engineering and site-directed 
mutagenesis as are well known in the art. 

A "fragment", in the context of PDI, refers to a protein wherein at one or more 
10 positions there have been deletions. Thus the fragment may, or may not, comprise at 
most 5, 10, 20, 30, 40 or 50%, typically up to 60%, more typically up to 70%, 
preferably up to 80%, more preferably up to 90%, even more preferably up to 95%, 
yet more preferably up to 99% of the complete sequence of the fiill mature PDI 
protein. Particularly preferred fragments of PDI protein comprise one or more whole 
1 5 domains of the desired protein. 

A fragment or variant of PDI may, or may not, be a protein that, when expressed 
recombinant] y in a host cell, can complement the deletion of the endogenously 
encoded PDI gene in the host cell, such as S. cerevisiae, and may, or may not, for 
20 example, be a naturally occurring homolog of PDI, such as a homolog encoded by 
another organism, such as another yeast or other fungi, or another eucaryote such 
as a human or other vertebrate, or animal or by a plant. 

Where the first chaperone is PDI, particularly mammalian or yeast PDI, then in 
25 one embodiment the second chaperone is not an hsp70 chaperone protein (such as 
yeast KAR2, HSP70, BiP, SSA1-4, SSB1, SSC1, SSD1 or a eucaryotic hsp70 
protein such as HSP68, HSP72, HSP73 ? HSC70, clathrin uncoating ATPase, IgG 
heavy chain binding protein (BiP), glucose-regulated proteins 75, 78 and 80 
(GPR75, GRP78 and GRP80) and the like). Specifically in one embodiment the 
30 first chaperone is not yeast PDI when the second chaperone is yeast KAR2. 
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Specifically in another embodiment the first chaperone is not mammalian PDI 
when the second chaperone is mammalian BiP. 

Alternatively, where the first and second chaperones are, for example, PDI, 
5 particularly mammalian or yeast PDI, and an hsp70 chaperone protein as 
described above, respectively, then the desired protein may be a heterologous 
protein that may or may not be a protein selected from - 

• mammalian gene products such as enzymes, cytokines, growth factors, 
hormones, vaccines, antibodies and the like; erythropoietin, insulin, 

10 somatotropin, growth hormone releasing factor, platelet derived growth 

factor, epidermal growth factor, transforming growth factor a, 
transforming growth factor (3, epidermal growth factor, fibroblast growth 
factor, nerve growth factor, insulin-like growth factor I, insulin-like 
growth factor II, clotting Factor VIII, superoxide dismutase, a-interferon, 

15 y-interferon, interleukin-1, interleukin-2, interleukin-3, interleukin-4, 

interleukin-5, interleukin-6, granulocyte colony stimulating factor, multi- 
lineage colony stimulating activity, granulocyte -macrophage stimulating 
factor, macrophage colony stimulating factor, T cell growth factor, 
lymphotoxin and the like; or human gene products; 

20 • any gene product which can be used as a vaccine, including any structural, 
membrane-associated, membrane-bound or secreted gene product of a 
mammalian pathogen, including viruses, bacteria, single-celled or multi- 
celled parasites which can infect or attack a mammal, in particular viruses 
such as human immunodeficiency virus (HIV), R. rickettsii, vaccinia, 

25 Shigella, poliovirus, adenovirus, influenza, hepatitis A, hepatitis B, dengue 

virus, Japanese B encephalitis, Varicella zoster, cytomegalovirus, hepatitis 
A, rotavirus, as well as vaccines against viral diseases like Lyme disease, . 
measles, yellow fever, mumps, rabies, herpes, influenza, parainfluenza and 
the like; or bacteria such as Vibrio cholerae, Salmonella typhi, Bordetella 
30 pertussis, Streptococcus pneumoniae, Hemophilus influenza, Clostridium 
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tetani, Corymebacterium diphtherial Mycobacterium leprae, 
Neisseriaqonorrhoeae, Neisseriameningitidis, Coccidioides immitis and 
the like. 

5 Another preferred chaperone is a protein comprising the sequence of a protein 
encoded by the gene SSA1, or a fragment or variant thereof having an equivalent 
chaperone-like activity. SSAJ , also known as YG100, is located on chromosome I 
of the S. cerevisiae genome and is 1 .93-kbp in size. 

10 One published protein sequence of the protein encoded by the gene SSA1 is as 
described in WO 2005/061718, the contents of which are incorporated herein by 
reference. 

A published coding sequence for SSA1 is also described in WO 2005/061718, the 
15 contents of which are incorporated herein by reference, although it will be 
appreciated that the sequence can be modified by degenerate substitutions to 
obtain alternative nucleotide sequences which encode an identical protein product. 

The protein Ssalp belongs to the Hsp70 family of proteins and is resident in the 
20 cytosol. Hsp70s possess the ability to perform a number of chaperone activities; 
aiding protein synthesis, assembly and folding; mediating translocation of 
polypeptides to various intracellular locations, and resolution of protein 
aggregates (Becker & Craig, 1994, Eur. J. Biochem. 219, 11-23). Hsp70 genes 
are highly conserved, possessing an N-terminal ATP-binding domain and a C- 
25 terminal peptide-binding domain. Hsp70 proteins interact with the peptide 
backbone of, mainly unfolded, proteins. The binding and release of peptides by 
hsp70 proteins is an ATP-dependent process and accompanied by a 
conformational change in the hsp70 (Becker & Craig, 1 994, supra), 

30 Cytosolic hsp70 proteins are particularly involved in the synthesis, folding and 
secretion of proteins (Becker & Craig, 1994, supra). In S. cerevisiae cytosolic 
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hsp70 proteins have been divided into two groups; SSA (SSA 1-4) and SSB (SSB 
1 and 2) proteins, which are functionally distinct from each other. The SSA 
family is "essential" in that at least one protein from the group must be active to 
maintain cell viability (Becker & Craig, 1994, supra). Cytosolic hsp70 proteins 
bind preferentially to unfolded and not mature proteins. This suggests that they 
•prevent the aggregation of precursor proteins, by maintaining them in an unfolded 
state prior to being assembled into multimolecular complexes in the cytosol and/or 
facilitating their translocation to various organelles (Becker & Craig, 1994, 
supra). SSA proteins are particularly involved in posttranslational biogenesis and 
maintenance of precursors for translocation into the endoplasmic reticulum and 
mitochondria (Kim et ah, 1998, Proc. Natl. Acad. Sci. USA. 95, 12860-12865: 
Ngosuwan et ah, 2003, J. Biol. Chem. 278 (9), 7034-7042). Ssalp has been shown 
to bind damaged proteins, stabilising them in a partially unfolded form and 
allowing refolding or degradation to occur (Becker & Craig, 1994, supra; Glover 
& Lindquist, 1998, Cell. 94, 73-82). 

Demolder et ah 1994, J. Biotechnol, 32, 179-189 reported that over-expression of 
SSA1 in yeast provided for increases in the expression of a recombinant 
chromosomally integrated . gene encoding human interferon-p\ There is no 
suggestion that increases in heterologous gene expression could be achieved if 
SSA1 and human interferon-^ were to be encoded by recombinant genes on the 
same plasmid. In fact, in light of more recent developments in the field of over- 
expression of chaperones in yeast (e.g. Robinson et al, 1994, op. cit.; Hayano et 
al, 1995, op. cit.; Shusta et al, 1998, op. cit; Parekh & Wittrup, 1997, op. cit.; Bao 
& Fukuhara, 2001, op. cit.; and Bao et al, 2000, op. cit ) the skilled person would 
have been disinclined to express SSA] from a 2um-family plasmid at all, much 
less to express both SSA1 and a heterologous protein from a 2um-family plasmid 
in order to increase the expression levels of a heterologous protein. 
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Variants and fragments of is a protein comprising the sequence of a protein 
encoded by the gene SSAJ are also included in the present invention. A "variant' \ in 
the context of a protein encoded by the gene SSAL refers to a protein having the 
sequence of native Ssalp other than at one or more positions where there have been 
amino acid insertions, deletions, or substitutions, either conservative or non- 
conservative, provided that such changes result in a protein whose basic properties, 
for example enzymatic activity (type of and specific activity), thermostability, 
activity in a certain pH-range (pH-stability) have not significantly been changed. 
"Significantly" in this context means that one skilled in the art would say that the 
properties of the variant may still be different but would not be unobvious over the 
ones of the original protein. 

By "conservative substitutions" is intended combinations such as Val, He, Leu, Ala, 
Met; Asp, Glu; Asn, Gin; Ser, Thr, Gly, Ala; Lys, Arg, His; and Phe, Tyr, Trp. 
Preferred conservative substitutions include Gly, Ala; Val, He, Leu; Asp, Glu; Asn, 
Gin; Ser, Thr; Lys, Arg: and Phe, Tyr. 

A "variant" of Ssalp typically has at least 25%, at least 50%, at least 60% or at least 
70%, preferably at least 80%, more preferably at least 90%, even more preferably at 
least 95%, yet more preferably at least 99%, most preferably at least 99.5% sequence 
identity to the sequence of native Ssalp. 

The percent sequence identity between two polypeptides may be determined using 
suitable computer programs, as discussed below. Such variants may, or may not, 
be natural or made using the methods of protein engineering and site-directed 
mutagenesis as are well known in the art. 

A "fragment", in the context of Ssalp, refers to a protein having the sequence of 

native Ssalp other than for at one or more positions where there have been deletions. 

Thus the fragment may, or may not, comprise at most 5, 10, 20, 30, -40 or 50%, 

typically up to 60%, more typically up to 70%, preferably up to 80%, more 
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preferably up to 90%, even more preferably up to 95%, yet more preferably up to 
99% of the complete sequence of the full mature Ssalp protein. Particularly 
preferred fragments of SSA1 protein comprise one or more whole domains of the 
desired protein. 

5 

A fragment or variant of Ssalp may, or may not 5 be a protein that, when expressed 
recombinant!}' in a host cell, such as S. cerevisiae, can complement the deletion of 
the endogenously encoded SSA1 gene (or homolog thereof) in the host cell and 
may, or may not, for example, be a naturally occurring homolog of Ssalp, such as 
10 a homolog encoded by another organism, such as another yeast or other fungi, or 
another eucaryote such as a human or other vertebrate, or animal or by a plant. 

Another preferred chaperone is protein comprising the sequence of a protein 
encoded by the PSE1 gene, or a fragment or variant thereof having equivalent 
15 chaperone-like activity. 

PSE1 9 also known as KAP121, is an essential gene, located on chromosome XIII. 

A published protein sequence for the protein Pselp is as described in WO 
20 2 005/06 1 7 1 8, the contents of which are incorporated herein by reference. 

A published nucleotide coding sequence of PSE1 is also described in WO 
2005/061718, the contents of which are incorporated herein by reference, although 
it will be appreciated that the sequence can be modified by degenerate 
25 substitutions to obtain alternative nucleotide sequences which encode an identical 
protein product. 

The PSE1 gene is 3.25-kbp in size. Pselp is involved in the nucleocytoplasmic 

transport of macromolecules (Seedorf & Silver, 1997, Proc. Natl Acad. Sci. USA. 

30 94, 8590-8595). This process occurs via the nuclear pore complex (NPC) 

embedded in the nuclear envelope and made up of nucleoporins (Ryan & Wente, 
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2000, Curr. Opin. Cell Biol. 12, 361-371). Proteins possess specific sequences 
that contain the information required for nuclear import, nuclear localisation 
sequence (NLS) and export, nuclear export sequence (NES) (Pemberton et al, 
1998, Curr. Opin. Cell Biol. 10, 392-399). Pselp is a karyopherin/importin, a 
group of proteins, which have been divided up into a and p families. 
Karyopherins are soluble transport factors that mediate the transport of 
macromolecules across the nuclear membrane by recognising NLS and NES, and 
interact with and the NPC (Seedorf & Silver, 1997, supra; Pemberton el al'., 1998, 
supra; Ryan & Wente, 2000, supra). Translocation through the nuclear pore is 
driven by GTP hydrolysis, catalysed by the small GTP-binding protein, Ran 
(Seedorf & Silver, 1997, supra). Pselp has been identified as a karyopherin p. 14 
karyopherin p proteins have been identified in S. cerevisiae, of which only 4 are 
"essential". This is perhaps because multiple karyopherins may mediate the 
transport of a single macromolecule (Isoyama et ah, 2001, J. Biol. Chem. 276 
(24). 21863-21869). Pselp is localised to the nucleus, at the nuclear envelope, 
and to a certain extent to the cytoplasm. This suggests the protein moves in and 
out of the nucleus as part of its transport function (Seedorf & Silver, 1997, supra). 
Pselp is involved in the nuclear import of transcription factors (Isoyama et al., 
2001, supra; Ueta et al, 2003, J. Biol. Chem. 27S (50), 50120-50127), histones 
(Mosammaparast et al., 2002, J. Biol. Chem. 277 (1), 862-868), and ribosomal 
proteins prior to their assembly into ribosomes (Pemberton et al, 1998, supra). It 
also mediates the export of mRNA.from the nucleus. Karyopherins recognise and 
bind distinct NES found on RNA-binding proteins, which coat the RNA before it 
is exported from the nucleus (Seedorf & Silver, 1997, Pemberton et al, 1998, 
supra). 

As nucleocytoplasmic transport of macromolecules is essential for proper 
progression through the cell cycle, nuclear transport factors, such as Pselp are 
novel candidate targets for growth control (Seedorf & Silver, 1997, supra). 
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Overexpression of Pselp (protein secretion enhancer) in S. cerevisiae has also 
been shown to increase endogenous protein secretion levels of a repertoire of 
biologically active proteins (Chow et aL, 1992; J. Cell. Scl 101 (3), 709-719). 
There is no suggestion that increases in heterologous gene expression could be 

5 achieved if Pselp and a heterologous protein were both to be encoded by 
recombinant genes on the same plasmid. In fact, in light of more recent 
developments in the over-expression of chaperones in yeast (e.g. Robinson et al, 
1994, op. ext.; Hayano et al 9 1995, op. cit; Shusta et al, 1998, op. cit; Parekh & 
• Wittrup, 1997, op. cit.: Bao & Fukuhara, 2001, op: cit; and Bao et ah 2000, op. 

10 cit ) the skilled person would not have attempted to over-express a PSE1 gene 
from a 2^m-family plasmid at all, much less to express both Pselp and a 
heterologous protein from a 2|im-family plasmid in order to increase the 
expression levels of a heterologous protein. 

15 Variants and fragments of Pselp are also included in the present invention. A 
"variant", in the context of Pselp, refers to a protein having the sequence of native 
Pselp other than for at one or more positions where there have been amino acid 
insertions, deletions, or substitutions, either conservative or non-conservative, 
provided that such changes result in a protein whose basic properties, for example 

20 enzymatic activity (type of and specific activity), thermostability, activity in a certain 
pH-range (pH-stability) have not significantly been changed. "Significantly" in this- 
context means that one skilled in the art would say that the properties, of the variant 
may still be different but would not be unobvious over the ones of the original 
protein. 

25 

By "conservative substitutions" is intended combinations such as Val, He, Leu, Ala, 
Met; Asp, Glu; Asn, Gin; Ser, Thr, Gly, Ala; Lys, Arg, His; and Phe, Tyr, Trp. 
Preferred. conservative substitutions include Gly, Ala; Val, He, Leu; Asp, Glu; Asn, 
Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 

30 

24 



WO 2006/067511 



PCT/GB2005/005085 



A "variant 55 of Pselp typically has at least 25% ; at least 50%, at least 60% or at least 
70%, preferably at least 80%, more preferably at least 90%, even more preferably at 
least 95%, yet more preferably at least 99%, most preferably at least 99.5% sequence 
identity to the sequence of native Pselp. 

5 

The percent sequence identity between two polypeptides may be determined using 
suitable computer programs, as discussed below. Such variants may, or may not, 
be natural or made using the methods of protein engineering and site-directed 
mutagenesis as are well known in the art. 

A "fragment 55 , in the context of Pselp, refers to a protein having the sequence of 
native Pselp other than for at one or more positions where there have been deletions. 
Thus the fragment may, or may not, comprise at most 5, 10, 20, 30, 40 or 50%, 
typically U P t0 60 ° /o > more typically U P t0 70% > preferably up to 80%, more 
15 preferably up to 90%, even more preferably up to 95%, yet more preferably up to 
99% of the complete sequence of the full mature Pselp protein. Particularly 
preferred fragments of Pselp protein comprise one or more whole domains of the 
desired protein. 

20 A fragment or variant of Psel p may, or may not, be a protein that, when expressed 
recombinant^ in a host cell, such as S. cerevisiae, can complement the deletion of 
the endogenous PSE1 gene in the host cell and may, or may not, for example, be a 
naturally occurring homolog of Pselp, such as a homolog encoded by another 
organism, such as another yeast or other fungi, or another eucaryote such as a 

25 human or other vertebrate, or animal or by a plant. 

Another preferred chaperone is a protein comprising the sequence of a protein 
encoded by the OM42 gene, or a fragment or variant thereof having equivalent 
chaperone-like activity. 

30 
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ORM2, also known as YLR350W. is located on chromosome XII (positions 
828729 to 829379) of the S. cerevisiae genome and encodes an evolutionarily 
conserved protein with similarity to the yeast protein Ormlp. Hjelmqvist el al, 
2002, Genome Biology, 3(6), research 0027.1-0027.16 reports that ORM2 belongs 
5 to gene family comprising three human genes (ORMDL1 , ORMDL2 and 
ORMDL3) as well as homologs in microsporidia, plants, Drosophila, urochordates 
and vertebrates. The ORMDL genes are reported to encode transmembrane 
proteins anchored in the proteins endoplasmic reticulum (ER). 

10 The protein Orm2p is required for resistance to agents that induce the unfolded 
protein response. Hjelmqvist et al, 2002 (supra) reported that a double knockout 
of the two S. cerevisiae ORMDL homologs (ORM1 and ORM2) leads to a 
decreased growth rate and greater sensitivity to tunicamycin and dithiothreitol. 

15 One published sequence of Orm2p is as described in WO 2005/061718, the 
contents of which are incorporated herein by reference. 

The above protein is encoded in S. cerevisiae by the coding nucleotide sequence 
also as described in WO 2005/061718, the contents of which are incorporated 
20 herein by reference, although it will be appreciated that the sequence can be 
modified by degenerate substitutions to obtain alternative nucleotide sequences 
which encode an identical protein product. 

Variants and fragments of Orm2p are also included in the present invention. A 

25 "variant", in the context of Orm2p, refers to a protein having the sequence of native 

Orm2p other than for at one or more positions where there have been amino acid 

insertions, deletions, or substitutions, either conservative or non-conservative, 

provided that such changes result in a protein whose basic properties, for example 

enzymatic activity (type of and specific activity), thermostability, activity in a certain 

30 pH-range (pH-stability) have not significantly been changed. "Significantly" in this 

context means that one skilled in the art would say that the properties of the variant 
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may still be different but would not be unobvious over the ones of the original 
protein. 

By "conservative substitutions'' is intended combinations such as Val, He, Leu, Ala, 
Met; Asp, Glu; Asn, Gin; Ser, Thr, Gly, Ala; Lys, Arg, His; and Phe, Tyr, Trp. 
Preferred conservative substitutions include Gly, Ala; Val, He, Leu; Asp, Glu; Asn, 
Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. 

A "variant" of Orm2p typically has at least 25%, at least 50%, at least 60% or at 
least 70%, preferably at least 80%, more preferably at least 90%, even more 
preferably at least 95%, yet more preferably at least 99%, most preferably at least 
99.5% sequence identity to the sequence of native Orm2p. 

The percent sequence identity between two polypeptides may be determined using 
suitable computer programs, as discussed below. Such variants may, or may not, 
be natural or made using the methods of protein engineering and site-directed 
mutagenesis as are well known in the art. 

A "fragment", in the context of Orm2p, refers to a protein having the sequence of 
native Orm2p other than for at one or more positions where there have been 
deletions. Thus the fragment may, or may not, comprise at most 5, 10, 20, 30, 40 or 
50%, typically up to 60%, more typically up to 70%, preferably up to 80%, more 
preferably up to 90%, even more preferably up to 95%, yet more preferably up to 
99% of the complete sequence of the . full mature Orm2p protein. Particularly 
preferred fragments of Orm2p protein comprise one or more whole domains of the 
desired protein. 

A fragment or variant of Orm2p may, or may not, be a protein that, when 

expressed recombinantly in a host cell, such as S. cerevisiae, can complement the 

deletion of the endogenous ORM2 gene in the host cell and may, or may not, for 

example, be a naturally occurring homolog of Orm2p, such as a homolog encoded 
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by another organism, such as another yeast or other fungi, or another eucaryote 
such as a human or other vertebrate, or animal or by a plant. 

A gene encoding a protein comprising the sequence of a chaperone may, or may 
5 not, be fonned in a like manner to that discussed below for genes encoding 
heterologous proteins, with particular emphasis on combinations of ORFs and 
regulatory regions. 

Thus, one preferred chaperone is protein disulphide isomerase; another preferred 
10 chaperone is Orm2p or a fragment or variant thereof. In a particularly preferred 
embodiment, the first and second chaperones are protein disulphide isomerase and 
Orm2p or a fragment or variant thereof. 

Further preferred combinations for the first and second chaperones, respectively, 

15 may, or may not, be encoded by the genes AHA J and CCT2; AHA J and CCT3; 
AHA1 and CCT4; AHA] and CCT5; AHA J and CCT6; AHA1 and CCT7; AHA J 
and CCT8; AHA J and CNSJ; AHA1 and CPR3; AHA] and CPR6; AHA] and 
EROl; AHA] and EUG1; AHA] and FMOl; AHA] and HCH1; AHA] and HSP10; 
AHA] and HSP12; AHA] and HSP104; AHA] and HSP26; AHA] and HSP30; 

20 AHA] and HSP42; AHA] and HSP60; AHA] and HSP78; AHA] and HSP82; 
AHAJ and JEM]; AHA] and MDJ1; AHA] and MDJ2; AHA] and MPD1; AHA] 
and MPD2\ AHA] and PDJ1; AHAJ and PFD1; AHA] and ABC]; AHAJ and 
APJ1; AHA] and ATP]]; AHA] and ATP12; AHA] and BTT1; AHA] and CDC37; 
AHA] and CPR7; AHA] and HSC82; AHA] and KAR2; AHA] and LHS1; AHA] 

25 and MGE1 ; AHA 7 and MRS] 1 ; AHA 1 and NOB] ; AHA] and ECM1 0; AHA 1 and 
SSA1; AHA] and SSA2; AHA] and SSA3; AHA1 and SSA4; AHAJ and SSC]; 
AHA] and SSE2; AHAJ and SIL1; AHA] and SLSJ; AHAJ and 07?Mr ; AHA] and 
0ifM2; y4/£47 and PER]; AHA] and P7C2; ^/£47 and PSE1; AHA] and 
AHA] and i£4C7 or a truncated intronless HAC1; CCT2 and CCT3; CCT2 and 

30 CCT2 and CCF5; CC77 and CCT6; CCT2 and CCT7; CCT2 and CC^S; 

CC77 and CNS1; CCT2 and CP7?5; CCT2 and CPR6; CCT2 and £i?07; CC72 and 
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EUG1; CCT2 and FMOl; CCT2 and HCHJ; CCT2 and HSP10: CCT2 and 

HSP12; CCT2 and HSP104; CCT2 and HSP26; CCT2 and HSP30; CCT2 and 

HSP42: CCT2 and HSP60: CCT2 and 72SP7S; CC72 and HSP82: CCT2 and 

J757W7; CCT2 and MDJ1 ; CC72 and MDJ2: CCT2 and MPD1 ; CC72 and MPD2: 

CCT2 and PDIF, CCT2 and PFD1: CCT2 and ABC7; CC72 and ,47>77; CCT2 and 

^rPii; CC72 and ATP22: CCT2 and 737T7; CC72 and CDC 3 7: CCT2 and CP7?7; 

CC72 and 77SCS2; CC72 and 7C47J2; CCT2 and Z77S7; CC72 and MGE1; CCT2 

and M7W77; CC72 and NOBJ; CCT2 and 75CM70; CCT2 and S&47; CC72 and 

SSA2; CCT2 and S&43; CC72 and SSA4: CCT2 and SSC7; CCT2 and SSE2; CCT2 

and 5717; CC72 and SLS7; CC72 and ORMJ; CCT2 and OKM2; CCT2 and 7>£7?7; 

CC72 and PTC2; CCT2 and PS£7; CC72 and UBI4; CCT2 and 7£4C7 or a 

truncated intronless 7£4C7; CCT3 and CC7W; CC73 and CCT5: CCT3 and CCP6; 

CC73 and CCT7; CCT3 and CCTS; CCT3 and CTvW; OT3 and CPR3; CCT3 and 

CP7?(5; CC73 and EROl; CCT3 and £l/G7; CCT3 and FM07; CC73 and 77C777; 

CCT3 and 7fS7>70; CC73 and 775P72; CCT3 and 7KP7(W; CCT3 and 77SP26; 

CC73 and HSP30; CCT3 and 77S7W2; CC73 and HSP60; CCT3 and T/STVS; 

CCT3 and 77S/>S2; CCP3 and JEMJ; CCT3 and MD.77; CC73 and MDJ2- CCT3 

and MP737; CCT3 and MP732; CC73 and 7>D77; CCT3 and PFD1; CCT3 and 

^73C7; CCO and /17V7; CCT3 and ,477>J7; OT3 and ATP12; CCT3 and 737T7; 

CC73 and CDC37; CCT3 and CP7?7; CCT3 and 77SCS2; CCP3 and K4R2; CCT3 

and Z77S7; CCT3 and MGE1; CCT3 and M7?577; CCT3 and A^<9737; CCT3 and 

£CM70; CCT3 and S&47; CCF3 and SSA2; CCT3 and S&43; CCT3 and S&44; 

CCT3 and SSC7; CCT3 and SS£2; CCT3 and SIL1; CCT3 and SLS7; CCT3 and 

OPJW7; CC73 and 07?M2; CCT3 and P£7?7; CC73 and PTC2; CCT3 and PST^; 

CCT3 and £773W; CC73 and 774C7 or a truncated intronless HAC1; CCT4 and 

CCT5; CCT4 and CCPd; CC7V and CCT7; CCT4 and CCrS; CCT4 and C7VS7; 

CC^ and CP7?3; CCT4 and CPT?^ CCT4 and £7?C7; CCr4 and ££7G7; CCT4 

and FMOl; CCT4 and HCH1; CCT4 and 77SP70; CC7V and HSP12; CCT4 and 

7/S7>7tW; CCT4 and 775 , P25; CC7V and HSP30; CCT4 and 77XP42; CC7V and 

HSP60; CCT4 and 77SP7S; CCT4 and 775P52; CC7V and JEM1: CCT4 and MDJ1; 

CCT4 and MDJ2; CCT4 and MP DP, CCT4 and MPD2; CCT4 and 7>Z>77; CC^ 
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and PFD1; CCT4 and ABC J: CCT4 and APJ1; CCT4 and ATP11; CCT4 and 

ATP 12: CCT4 and BTT1: CCT4 and CDC37; CCT4 and CPR7; CCT4 and HSC82; 

CCT4 and KAR2; CCT4 and £7757; CCT4 and MGP7; COT and MRS 11; CCT4 

and 7VCP7; COT and ECM10; CCT4 and 5547; CCT4 and 55^2; COT and 

5 5545; CCT4 and 5544; CCT4 and 55C7; COT and SSE2; CCT4 and 577,7; COT 

and 5L57; CCT4 and OPM7; CCT4 and <97?M2; COT and PERI; CCT4 and 

P7C2; COT and P5P7; CCT4 and C/B7*; COT and 77/JC7 or a truncated 

intronless HAC1; CCT5 and CCT6; CCT5 and COT; COT and CCT8; COT and 

C7V57; COT and CPR3; COT and CP7?6; CCT5 and PP<37; COT and EUG1; 

10 COT and PM97; CCT5 and 7/C777; COT and HSP 10; CCT5 and 775P72-; COT 

and HSP 1 04; CCT5 and HSP26; CCT5 and 7/5P50; COT and 7J5P42; COT and 

HSP60; CCT5 and 775P7S; COT and HSP82; CCT5 and JEM1; CCT5 and MPV7; 

COT and MD.J2; CCT5 and MP737; COT and MPD2; CCT5 and PP>77; CCT5 

and PFD1; CCT5 and .4PC7; COT and ,4P77; COT and ATP 11; CCT5 and 

15 ATP 12; CCT5 and 57T7 ; CCT5 and CDC3 7; COT and CPP. 7; COT and HSC82; 

CCT5 and 7C4P2; CCT5 and L7757; COT and MGE1; CCT5 and M7?577; COT 

and NOB1; CCT5 and PCM70; CCT5 and 5547; COT and SSA2; CCT5 and 

5545; COT and SSA4; CCT5 and 55C7; CCT5 and 55P2; COT and 5717; CCT5 

and 5Z57; COT and ORMJ; CCT5 and OPM2; CCT5 and PPP7; COT and 

20 PTC2; CCT5 and P5P7; CCT5 and £77374; COT and 7P4C7 or a truncated 

intronless 7£4C7; CCT6 and CC77; COT and COT; CCT6 and C7V57; CCT6 and 

CPP5; COT and CPR6; CCT6 and 7±7?<37; COT and EUG1; CCT6 and FMOl; 

CCT6 and 77C777; COT and 775P70; CCT6 and 775P72; COT and HSP 104; 

CCT6 and 775P26; COT and 7ZSP50; COT and HSP42; CCT6 and 775P^0; 

25 CCT6 and 775P7S; CCP<5 and HSP82; CCT6 and JEM1; CCT6 and MDJ1; CCT6 

and MDJ2; CCT6 and MPD1; CCT6 and MPD2; CCT6 and PD77; COT and 

PFDl; CCT6 and ABC7; COT and ^PJ7; COT and ^7P77; COT and ^PP72; 

COT and BTT1; CCT6 and CDC57; CCT6 and CP7?7; COT and 775CS2; COT 

and 7C47?2; COT and 17757; COT and 7wGP7; CCT6 and M7?577; COT and 

30 NOB1; CCT6 and ECM10; CCT6 and 5547; COT and SSA2; CCT6 and 5545; 

COT and SSA4; CCT6 and 55C7; CCT6 and 55P2; COT and 577,7; CCT6 and 
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SLS1; CCT6 and ORMP, CCT6 and 0RM2: CCT6 and PERI; CCTS and PTC2: 

CCT6 and PSE1; CCT6 and UBI4: CCT6 and HAC1 or a truncated intronless 

/£4C7: CCT7 and CCTS; CCT7 and CAW ; CC77 and CPi?5; CCT7 and CP/?<5; 

CC77 and EROJ; CCT7 and £L/G7; CC77 and FMOJ; CCT7 and //C/77; CCT7 

and HSP10; CCT7 and HSP12; CCT7 and HSP104; CCT7 and fl5P2(J; CCT7 and 

/7SP30; CC77 and CC77 and CC77 and 7f&P7S; CC77 and 

HSP82; CCT7 and J£M7; CCT7 and MZX/7: CC77 and MDJ2; CCT7 and MP DP, 

CCT7 and MPD2; CCT7 and PD/7: CC77 and PFDP, CCT7 and ABC7; CC77 

and ^PJi; CC77 and ATP1P, CCT7 and /J7T72: CCT7 and P7T7; CC77 and 

CDC37; CCT7 and CP7?7; CCT7 and 77SCS2; CC77 and 7C47?2; CCT7 and Z#W; 

CC77 and MGEJ; CCT7 and M7J577; CC77 and NOBJ; CCT7 and ECM10: CCT7 

and S&47; CC77 and S&42; CCT7 and CC77 and SSA4; CCT7 and SSC7; 

CC77 and SSE2; CCT7 and S/I7; CCT7 and SZW; CCP7 and ORMP CCT7 and 

0PM2; CC77 and PER]; CCT7 and P7C2; CCT7 and J>SEJ; CC77 and UBI4; 

CCT7 and 7Z4C7 or a truncated intronless 7Z4C7; CCZ5 and CAW; CCT8 and 

CP7?J; CCTS and CPPtf; CCTS and EROp CCT8 and ££7G7; CCTS and FMOl ; 

CC7S and LfC/77; CCT8 and J3KW0; CCTS and 77SP72; CCTS 1 and HSP104; 

CCT8 and 7JSP26"; CCTS and HSP30; CCTS and 77SP42; CC75 and HSP60; 

CCT8 and 7TSP7S: CCTS and /75P52; CCTO and JEMP, CCTS and MDJP, CCTS 

and MDJ2; CCT8 and MPD1 ; CCTS and MPD2; CCT8 and PDTi; CCTS and 

PFDP. CCT8 and ASC7; CCTS and APJ7; CCT8 and ,4T7>77; CCTS and ,47Pi2; 

CCTS and PTT7; CCTS and CDC37; CCTS and CP7?7; CCT8 and T7SCS2; CCTS 

and 7C4P2; CCT8 and X7TW; CCTS and MGW; CCT8 and MPW7; CC7.5 and 

A'OS7; CCTS and 7±CM70; CCT8 and S&47; CCTS and ££42; CCPS and SSA3; 

CCT8 and CCTS and SSC7; CCT8 and SKE2; CCT8 and 5/Z7; CCTS and 

SUSP, CCT8 and 07?M7; CCTS and ORM2; CCTS and PERP, CCT8 and PTC2; 

CCTS and PSEi; CCTS and CS74; CCTS and HAC1 or a truncated intronless 

HACP, CNS1 and CPP5; CNS1 and CP7M; CAW and £7?<?7; CAW and EUG1; 

CAW and FMOl ; CAW and 77C777; CAW and 7TSP/0; CAW and HSP12; CAW 

and 77SP704; CAW and HSP26; CNSJ and HSP30; CNS1 and 7TSP42; CAW and 

HSP60; CNSJ and 77SP7S CAW and HSP82; CNS1 and J£M7; CAW and MDJ1 ; 
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CAW and MD.J2; CNSJ and MPDJ; CNSJ and MPD2; CNSJ and PD11: CNSJ 

and PFDh CNSJ and ABCJ: CNSJ and APJJ: CNSJ and ATPJJ; CNSJ and 

^PP72; CAW and 5777; CAW and CDC37; CNSJ and CPP7; CAW and HSC82: 

CNSJ and 7C4P2; CAW and Z//S7: CAW and MGE1; CNSJ and MRS J J ; CAW 

and TO; CAW and ECMJO; CNSJ and S&47: CNSJ and S&42; CAW and 

SSA3; CNSJ and CAW and SSCJ; CNSJ and SSP2; CAW and SILJ; CNSJ 

and SLSJ; CNSJ and OPM7; CAW and ORM2: CNSJ and PFP7; CAW and 

PTC2; CNSJ and PSEJ: CNSJ and [75/4; CAW and HACJ or a truncated 

intronless HACJ; CPR3 and CPR6; CPR3 and EROJ; CPR3 and EUGJ: CPR3 

and FMOJ; CPR3 and 77CH7; CPP3 and 77SP70; CPR3 and 77SP72: CPP3 and 

HSPJ04; CPR3 and fJS/»2d; CPP3 and HSP30; CPR3 and 7YSP42; CP7?3 and 

HSP60; CPR3 and HSP78; CPR3 and 77SPS2; CPP3 and .7PM7; CPR3 and MDJJ; 

CPR3 and M73J2; CPP3 and MPDJ\ CPR3 and MP732: CP7?3 and PP>77; CP7?3 

and PF7)7; CPR3 and ^PC7; CPP3 and APJJ; CPR3 and ^PP77; CPP3 and 

^7P72; CP7?3 and 57T7; CPR3 and CDC37; CPR3 and CPR7; CPR3 and 77SCS2; 

CPP3 and KAR2; CPR3 and 7.7757: CPR3 and MGEJ; CPR3 and MRS 11; CPR3 

and 7VCW; CP7?3 and ECMJO; CPR3 and S&47; CPR3 and SSA2; CPR3 and 

SSA3; CPR3 and SS^; CPP3 and 55C7; CPR3 and SSE2; CPR3 and SILJ; CPR3 

and SLSJ; CPR3 and C7U47; CPP3 and ORM2; CPR3 and PFP7; CPR3 and 

PTC2; CPR3 and P5P7; CPR3 and CP74; CPP3 and 7£4C7 or a truncated 

intronless HACJ; CPR6 and EROJ; CPR6 and EUGJ; CPR6 and FM07: CPP5 

and 77C777; CP7?6 and HSPJO; CPR6 and 77SP72; CPR6 and 7ZSP704; CP7?<5 and 

HSP26; CPR6 and 775P30; CP7?tf and HSP42; CPR6 and TfSPtftf; CPR6 and 

HSP78; CPR6 and 775P52; CPR6 and JEM1; CPR6 and MP»J7; CPR6 and MDJ2; 

CPR6 and MPD1; CPR6 and MPD2; CPR6 and PD77; CPR6 and PFZ>7; CPP<5 

and ^PC7; CPTtt and ^PJ7; CPR6 and ^PP77; CP7?<5 and ^PP72; CPP(5 and 

PP77; CP7?<5 and C/X737; CPR6 and CP7?7; CPPtf and 775C52; CPR6 and 7C4P2; 

CPP5 and LHSJ; CPR6 and MGEJ; CPR6 and M/W7; CPP5 and NOB1; CPR6 

and £CM70; CPP6 and SSAJ; CPR6 and S&42; CP7?<5 and SSA3; CPR6 and SSA4; 

CPR6 and SSC7; CPP(5 and SSE2; CPR6 and 5/Z7; CPR6 and S7W; CP7M and 

ORMl CPR6 and OTM; CPR6 and PER J; CPR6 and PPC2; CPPtf and P5P7; 
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CPR6 and UB14; CPR6 and HAC1 or a truncated intronless HAC1: EROl and 

£GG7; EROl and FMOl: EROl and 77C777; EROl and HSP10; EROl and 

77S£72; ££07 and HSP104: EROl and 77S£2<5; EROl and 77S£50; ££07 and 

HSP42; EROl and 77S£60; ££07 and HSP78: EROl and 77S£S2: EROl and 

5 J£M7; ££G7 and MDJ1; EROl and MDJ2: EROl and MP£7; ££07 and MPD2; 

EROl and P£/7: £7?G7 and ££D7; £7?G7 and ,4£C7; ££07 and APJ1; EROl and 

,47£77; EROl and ATP12; EROl and 5777: ££G7 and CDC3 7; ££6>7 and 

C£7?7; ££07 and 77SCS2; ££G7 and 7£47?2; £7?07 and 777S7; ££07 and MGE1: 

EROl and MRS11; EROl and NOB 1\ EROl and £CM70; £7207 and SSA1; EROl 

10 and SSA2; EROl and S&43; £7? 01 and ££G7 and SSC1; EROl and SS£2; 

££G7 and 57X7; ££07 and SLS1; EROl and 07?M7; EROl and ORM2; EROl and 

££7G; £7?07 and PTC2; EROl and £S£7; ££07 and UB14: EROl and 774C7 or a 

truncated intronless HAC1; EUG1 and £M07; £L/G7 and 77C777; EUG1 and 

77S£70; EUG1 and 77S£72; £0G7 and HSP104: EUG1 and HSP26: EUG1 and 

15 HSP30; EUG1 and 77S7M2; £0G7 and HSP60; EUG1 and HSP78; EUG1 and 

77S£S2; £GG7 and J£M7; £GG7 and MDJ1; EUG1 and MDJ2; EUG1 and 

MPD1\ EUG1 and MPD2; EUG1 and £7)77; £GG7 and £££7; £GG7 and ^£C7; 

£GG7 and APJ1; EUG1 and ^££77; £GG7 and ^££72; £GG7 and £777; £GG7 

and CDC37; EUG1 and CPR7; £GG7 and HSC82; EUG1 and 7G4£2; £GG7 and 

20 777S7; £GG7 and MG£7; £GG7 and K4RSU; EUG1 and NOB1; EUG1 and 

£CM70; EUG1 and SSAl; £GG7 and SSA2; EUG1 and SSA3; EUG1 and SSA4; 

EUG1 and SSC1; EUG1 and SSE2; EUG1 and SIL1; EUG1 and SLS1; EUG1 and 

ORM1; EUG1 and ORAM; EUG1 and £££7; £GG7 and PTC2: EUG1 and £S£7; 

£GG7 and UB14: EUG1 and 774C7 or a truncated intronless 774C7; £M07 and 

25 77C777; £M07 and HSP10; FMOl and 77S£72; FMOl and HSP104; FMOl and 

77S£26"; £MG7 and HSP30; FMOl and HSP42; FMOl and 77S£o£; £M07 and 

77S£7S; FMOl and 775£52; £M07 and JEM1; FMOl and M7X7; £M07 and 

MDJ2; FMOl and MP£7; £MG7 and MPD2; FMOl and ££77; £MG7 and 

££D7; £M07 and ^£G7; £M07 and APJ1; £V707 and ATP 11; FMOl and 

30 ^7£72; FMOl and ££77; FMOl and CDC37: FMOl and C£7?7; £M07 and 

77SCS2: £M07 and 7G4£2; £M07 and 17757; £M07 and MG£7; £MG7 and 
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MRS11; FMOl and NOB1; FMOl and ECM10; FMOl and SSA1; FMOl and 

SSA2; FMOl and SSA3; FMOl and SSA4; FMOl and SSC1; FMOl and SSE2; 

FMOl and SIL1; FMOl and 5157; FM07 and 0RM1; FMOl and 0PM2; FMOl 

and PPP7; PM07 and PTC2; FMOl and PSP7; FM<97 and UB14: FMOl and 

7£4C7 or a truncated intronless 7£4C7; HCH1 and 7/SP70; 77C7P7 and HSP12; 

HCH1 and 77SP704; PTC777 and HSP26; HCH1 and 77SP50; HCH1 and 77SP42; 

77C777 and HSP60: HCH1 and 7/SP75; /7C/77 and HSP82: HCH1 and JPM7; 

HCH1 and MDJ1; HCH1 and MDJ2; 77C777 and MPD1; HCH1 and MPD2\ 

HCH1 and P73/7; 77C/77 and PF737; 7YC777 and ABC1; HCH1 and .4PJ7; 7/C777 

and ,4PP77; HCH1 and .4PP72; HCH1 and 5777; 77C777 and CDC37; HCH1 and 

CPP7; 7/C777 and 77SCS2; 77C7Y7 and KAR2: HCH1 and 777S7; HCH1 and 

MGE1; HCH1 and MRS11; HCH1 and NOB1; HCH1 and ECM10; HCH1 and 

SX47; 77C/77 and SSA2; HCH1 and 77C/77 and SSA4; HCH1 and 55C7; 

77C777 and SSE2; HCH1 and 5717; HCH1 and 5L57; #C//7 and ORM1: HCH1 

and OPM2; HCH1 and PERI; HCH1 and PTC2; 77C777 and PSE1; HCH1 and 

UBI4: HCH1 and 7£4C7 or a truncated intronless 7£4C7; HSP10 and 775P72; 

77SP70 and HSP104; HSP10 and 77SP26; 77SP70 and HSP30; HSP10 and 77SP42; 

HSP10 and 77SP60; 77SP70 and HSP78: HSP10 and HSP82; HSP10 and 7PM7; 

77SP70 and MDJ1; HSP10 and MDJ2; HSP10 and MPD1 ; /7SP70 and MPD2; 

HSP10 and P7)77; 77SP70 and PF737; 7/SP70 and ^PC7; 77SP70 and ^PJ7; 

775P70 and ^PP77; 775P70 and ^PP72; 775P70 and P7T7; HSP10 and CDC37; 

HSP10 and CP7?7; 77SP70 and 775C52; 775P70 and KAR2; 775P70 and LHS1; 

HSP10 and MGE1; HSP10 and MRS11; HSP10 and NOB1; 775P70 and ECM10; 

HSP10 and SSA1; HSP10 and SSA2; HSP10 and SSA3; HSP10 and SSA4; HSP10 

and SSC1; HSP10 and SSE2; HSP10 and SIL1;HSP10 and SLS1; HSP10 and 

ORM1; HSP10 and ORM2; HSP10 and PERI; HSP10 and PTC 2; HSP10 and 

PST7; 775P70 and Z7PJ4; 77SP70 and 7P4C7 or a truncated intronless 7£4C7; 

HSP1 2 and HSP104; HSP12 and HSP26; HSP12 and 77SP30; 775P72 and HSP42; 

775P72 and HSP60; HSP12 and 77SP7S; HSP12 and 7P5PS2; PISP12 and J EMI; 

HSP12 and MZ)J7; 775P72 and MDJ2; HSP12 and MP737; 775P72 and MPD2; 

HSP12 and PD77; 775P72 and PPX>7; 77SP72 and ABC1; 775P72 and APJ1; 

34 



WO 2006/067511 



PCT/GB2005/005085 



HSP12 and ATP1 7; HSP12 and ATP J 2; HSP12 and BTTJ; HSP12 and CDC37; 
HSP12 and CPR7; HSP12 and HSC82; HSP12 and KAR2: HSP12 and LHSJ; 
HSP12 and MGE1; HSP12 and M72577; 7/5P72 and NOBJ; HSP12 and PCM70; 
HSP12 and 55,47; 7/5P72 and 55.42; HSP12 and 55,45; //SP72 and SSA4; HSP12 

5 and SSC1; HSP12 and 55P2; 7ZSP72 and 57X7: 775P72 and 5L57: 7/5P72 and 
OKM7; 775P72 and ORM2; HSP12 and PPP7; 775P72 and PTC2; HSP12 and 
P5£7; HSPJ2 and C7S74: HSP12 and 7£4C7 or a truncated intronless HACJ; 
HSP104 and #5P26~; 7/5P704 and HSP30; HSP104 and 7/5P42; HSP104 and 
7ZSP60; 775P704 and HSP78; HSP104 and 775PS2; 775P704 and JEMJ; HSP104 

10 and MDJJ; HSP104 and MDJ2; HSP J 04 and MPD1 ; 7/5P704 and MP722; 
775P7(W and PD/7; 775P704 and PFDJ; HSP104 and ,4PC7; HSP104 and .4PJ7; 
7/5P7W and ,47P77; HSP104 and ,4PP72; 775P704 and BTTJ; HSP104 and 
CP>C37; 775P704 and CPR7; HSP104 and HSC82; HSP104 and 7C47?2: 775P704 
and 17757: HSP104 and MGE1; HSP104 and M7?577; 7/5P704 and NOBJ; 

15 775P704 and ECMJO; HSP104 and 55,47: 7/5P704 and SSA2; HSP104 and 

775P704 and SSA4; HSP104 and 55C7; HSP104 and 55E2; 775P704 and 5/17; 
775P704 and 5Z57; 775P704 and ORMJ; HSP104 and ORM2; HSPJ04 and P£7?7; 
HSPJ04 and PTC2\ HSPJ04 and PSEJ;HSPJ04 and UBJ4; HSP104 and i£4C7 or 
a truncated intronless 7£4C7; 775P26" and HSP30; HSP26 and 775P42; HSP26 and 

20 7/5P60: 775P26" and 775P75; 775P26" and 775P52; 775P2d and JEM! ; 775P26 and 
MD.77; HSP26 and MDJ2; HSP26 and MPDJ; HSP26 and MPD2; HSP26 and 
PP>/7; 775P2<5 and PPD7; 775P26 and ABC I; HSP26 and ,4PP7; HSP26 and 
,4PP77; HSP26 and ,4PP72; 775P26" and PP77; HSP26 and CDC37; HSP26 and 
CP7?7; 775P25 and 775C52; 775P26" and KAR2; HSP26 and LHS1; HSP26 and 

25 MGE1; HSP26 and MP577; 775P25 and NOBJ; HSP26 and ECMJO; HSP26 and 
55,47; 775P26" and 5&42; HSP26 and SSA3; HSP 2 6 and 55,44; /75P2<> and 55C7; 
HSP26 and SSE2;HSP26 and 5/Z7; 775P2d and 5L57; HSP26 and ORMJ; HSP26 
and OT?M2; HSP26 and PP7?7; 775P26 and PTC2; HSP26 and P5£7; 775P26 and 
UBI4; HSP 2 6 and HAC1 or a truncated intronless HAC1; HSP 30 and HSP 42; 

30 775P50 and TTSPtfO; 7/5P50 and' HSP78; HSP 30 and 775P52; 7/5P30 and JEM1; 

HSP30 and 714ZX/7; 775P50 and MDJ2; HSP30 and MPDJ; HSP30 and MPD2; 
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HSP30 and PD77; HSP30 and PFDJ: HSP30 and ABC1; HSP30 and APJJ: 

HSP30 and ATP IP, HSP30 and ATP 12: HSP30 and BTTJ: HSP30 and CDC37: 

HSP30 and CP/? 7; HSP30 and /7SCS2: 7/5P30 and KAR2: HSP30 and L//57; 

HSP30 and MGP7; 775P30 and MP577: HSP30 and 7VOB7; 7/5P30 and ECMJO; 

HSP30 and 55.47; 7/5P30 and SSA2; HSP30 and 5543: HSP30 and 55^*4; 7/5P30 

and 55C7; HSP30 and 55P2; 7/5P30 and 5/Z7; 775P30 and 5L57; HSP30 and 

ORM7; 7/5P30 and ORM2; HSP30 and P£P7; 7/5P30 and PTC2; HSP30 and 

P5£7; HSP30 and t/6/4; 7/5P30 and 774 C7 or a truncated intronless HAC1: 

HSP42 and HSP60; HSP42 and HSP78; HSP42 and 775PS2; 775P42 and JEM1 ; 

775P42 and MD77; 7KP42 and MLV2; 77^^/2 and MPP>7; HSP42 and 714PD2; 

HSP42 and PD77; /75P^2 and PFDJ; HSP42 and ^PC7; 7^SP4?2 and ^PJ7; 

HSP42 and ^PP77; HSP42 and ^PP72; 7KP42 and BTTJ: HSP42 and C/5C57; 

775P42 and CPR7; HSP42 and 7/5CS2; HSP42 and 7C4P2; HSP42 and XT/57; 

HSP42 and MGEP, HSP42 and MP577; T/SP42 and A>OP7; HSP42 and ECMJO: 

HSP42 and 5547; HSP42 and 5542; 7Y5P42 and 5543; HSP42 and 5544; 775P42 

and SSCJ; HSP42 and SSE2; HSP42 and 57Z7; 775P^2 and SLS1; HSP42 and 

ORMP, HSP42 and OPM2; 775P42 and PERI; HSP42 and PTC2: HSP42 and 

P5£7; 775P42 and £7PJ4; HSP42 and HAC1 or a truncated intronless 774C7; 

HSP60 and HSP78; HSP60 and 7/5PS2; 7/5P60 and J£M7; HSP60 and MPV7; 

HSP60 and MDJ2; HSP60 and MPLV; 77SP50 and MP7J>2; 775P50 and PD/7; 

HSP60 and PP737; HSP60 and ^PC7: 7/5P<50 and .4P./7; HSP60 and ^PP77; 

7P5P60 and 47P72; HSP60 and P7T7; 775P60 and CDC37; HSP60 and CPR7; 

HSP60 and HSC82; HSP60 and 7C4P2; 775P50 and ZP457; HSP60 and MG£7; 

7/5P60 and M7?577; HSP60 and 7^0P7; 775P<50 and ECMJO: HSP60 and 5547; 

HSP60 and 5542; 7/5Pd0 and 5543; HSP60 and 5544; /75PdO and 55C7; HSP60 

and 55P2; HSP60 and SILJ; 775P60 and 57,57; HSP60 and ORMP, HSP60 and 

ORM2; HSP60 and PERT, HSP60 and PTC 2; HSP60 and P5P7; 775P60 and 

L/7374; 775P<5C and 774C7 or a truncated intronless 7£4C7; HSP78 and HSP82; 

HSP78 and JEMP, HSP78 and MLV7; 7/5P7S and MPV2; 7/5P7S and MPD1 ; 

775P75 and MPD2; HSP78 and PD77; 775P75 and PPD7; 775P7S and /f/3C7; 

HSP78 and APJJ: HSP78 and ATP 11; HSP78 and ^PP72: HSP78 and BTTJ; 

36 



WO 2006/067511 



PCT/GB2005/005085 



HSP78 and CDC37: HSP78 and CPR7: HSP7S and HSC82: HSP78 and KAR2; 

HSP78 and LHS1: HSP78 and MGEh HSP78 and MRS1J; HSP78 and NOB1: 

HSP78 and £CMJ0; HSP78 and /7SP7S and SSA2: HSP78 and 5S^3; 

7/SP7S and SSA4: HSP78 and SSC7; HSP78 and &S£2; 7/SP7S and SIL1: HSP78 

and SLS7; 7TCP7S and ORMJ: HSP78 and OfcM2; HSP78 and 7>£P7; HSP7S and 

PTC2: HSP78 and PS£7; HSP78 and t/574; /7S7VS and 7£4C7 or a truncated 

intronless 7/4C7; HSP82 and JPM7: HSP82 and MZ)J7; ifSP52 and MDJ2; 

HSP82 and MP737; /ZSPS2 and MPD2; HSP82 and PD/7; JYSPS2 and PFDl; 

HSP82 and ^5C7; 77SPS2 and APJ1; HSP82 and ^7T77; HSP82 and ^TP72; 

HSPS2 and BTT1; HSP82 and CDC37; HSP82 and CPP7; 77SP52 and 7YSC52; 

HSP82 and i£47f2; /7SPS2 and LHS.1; HSP82 and MG£7; 7/5P82 and MRS11; 

HSP82 and 7VOP7; HSP82 and PCM70; HSPS2 and SSA1: HSP82 and £S,42; 

H5P52 and SSA3; HSP82 and HSP82 and 55C7; /75PS2 and SSE2: HSP82 

and S7I7; 7/5P52 and SLS1; HSP82 and OPM7; HSP82 and ORM2-, HSP82 and 

P£7?7; #S/>£2 and PTC2; 775P52 and PSE1: HSP82 and £73/4; 77SPS2 and 7/4C7 

or a truncated intronless HACJ: JEM1 and M73J7: J£M7 and MDJ2; JEM1 and 

MPD1; JEM1 and MPD2; JEM1 and PD/7; JPM7 and PFDl; JEM1 and ^PC7; 

J£M7 and APJ1; JEM1 and ^7T77; J£M7 and ATP 12; JEM] and 75m; JEM1 

and CDC57; JEM7 and CPR7; JEM1 and 775C52; JPM7 and KAR2; JEM1 and 

I//57; JEM1 and MGE1; JEM1 and MRSH; JEM1 and 7^0727; J£M7 and 

ECM10; JEM1 and 5*5^7; JEM1 and 55,42; J£M7 and SSA3; JEM1 and 

J£M7 and SSC1; JEM1 and SS£2; JEM1 and 5/17; JPM7 and SLS1; JEM1 and 

OPM7; .7£M7 and ORM2; J£M7 and PERI; JEM1 and PTC2; JEM1 and PSE1; 

JEM1 and £75/4; 2£M7 and /£4C7 or a truncated intronless HAC1; MDJ1 and 

MDJ2; MDJ1 and MPD1; MDJ1 and MPD2; MDJ1 and PD/i; MA/7 and PPD7; 

MA77 and ylPC7; MDJ7 and APJ1; MDJ1 and ^/Pii; MDJ1 and ^/P72; MDJ1 

and P7T7; MD./7 and CDC3 7; M/X77 and CPP7; MDJ1 and 7/5(252; MDJ1 and 

7^4P2; MA/7 and ZJ/57; MDJ1 and MGP7; MDJ1 and MP577; MD./7 and 7/OB7; 

MM and £CM70; MDJ1 and 5547; M72.77 and 5542; MDJ1 and SSA3; MDJ1 

and 5544; MDJ1 and 55C7; MA/7 and SSE2; MDJ1 and 57X7; MA/7 and 5AS7; 

MM and 07?M7; MA/7 and OPM2; MA77 and PERI: MDJ1 and PTC 2: MDJ1 
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and PSE1: MDJ1 and UB14; MDJ1 and HAC1 or a truncated intronless HAC1; 

MDJ2 and MPD1; MDJ2 and MPD2; MDJ2 and PD11; MDJ2 and PFD1; MDJ2 

and ABC1; MDJ2 and APJ/; MDJ2 and ,4PP//; MDJ2 and ,4PP/2; MDJ2 and 

P7T/; MDJ2 and CDC37; MDJ2 and CP/? 7; M/3J2 and //5G52; MDJ2 and /C4P2; 

M)J2 and Pf/5/; M/3./2 and MGP/; MPa/2 and M/?5//; M73J2 and NOB1; MDJ2 

and ECMJO: MDJ2 and 554/; MDJ2 and 5542; MD.J2 and 55A?; MDJ2 and 

55,44; MDJ2 and 55C/; MDJ2 and 55P2; M£./2 and 5/17; MDJ2 and 5/57; 

MDJ2 and ORM1; MDJ2 and ORM2;MDJ2 and PPP/; MDJ2 and P7C2; M/5J2 

and P5P/; MDJ2 and CS/4; MZX72 and 7/4 C/ or a truncated intronless 7/4 C/; 

MPD/ and MPD2; MPD1 and P/5//; MPD/ and PFD1; MPD1 and ABC/ ; MP/57 

and ,4P//; MP/57 and ATP 11; MPD1 and ^7T72; MPD1 and 5/77; MPD1 and 

CDC 3 7; MPD1 and CP/? 7; MP/3/ and 7/5GS2; MPP>7 and /C4P2; MPD1 and 

LHS1; MPD1 and MGP7; MPD1 and MP5/7; MP/5/ and A'OP/; MPD1 and 

PCM/0; MPD1 and 55,4/; MPD1 and 5542; MPD1 and 55,45; MPD1 and 5X44; 

MPD1 and 55C/; MPD1 and 55P2; MP/>/ and 57/7; MP/3/ and 5/57; MPD1 and 

6>/?M7; MPD1 and OPM2; MPD1 and PP/?/; MPD1 and P/C2; MPD1 and P5P/ ; 

MP/37 and £72/4; MP/)/ and HAC1 or a truncated intronless 7Z4C/; MP/)2 and 

P/3//; MP/52 and PFD1; MPD2 and ,4PC/; MP/>2 and ^PJ/; MPD2 and ,4PP/ / ; 

MP/32 and ,4PP/2; MP/32 and PPP/; MPD2 and CDC3 7; MP/32 and CPP7; 

MPD2 and //5C52; MP/)2 and /C4/?2; MP/32 and ///5/; MP/52 and MGE1; 

MPD2 and MRS 11; MPD2 and A'OP/; MPD2 and PCM/0; MP/52 and 554/; 

MP/32 and 55,42; MP/52 and 5545; MPD2 and 55,44; MP/52 and 55C/; MPD2 

and 55P2; MP/52 and SILT, MPD2 and 5/5/; MPD2 and CPM/; MPD2 and 

<3/?M2; MP/52 and PPP/; MP/32 and PTC2; MPD2 and PSE1; MPD2 and CPM; 

MP/32 and HAC1 or a truncated intronless HAC1; PDI1 and PP/3/; P/»// and 

ABC1; PD11 and ,4PJ/; PD11 and ,4PP//; P/5// and ATP12; PDI1 and PPP/; 

P/5// and CDC37; PDI1 and CP/? 7; P/5// and //5CS2; P/)// and KAR2; PD11 and 

///5/; PP>// and MGE1: PD11 and MRS 11; PDI1 and A'OP/; PD// and ECM10; 

PD11 and 55,4/; P/3// and SSA2; PDI1 and 5545; PDI1 and 5544; P/)// and 

SSC1; PD11 and 55P2; P/3// and SIL1; PD11 and 5/5/; P/5// and CPM/; P/5// 

and OR1M2; PD11 and PERI; PD11 and P/C2; P/3// and PSE1; PDI1 and CPM; 
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PDJJ and HACJ or a truncated intronless HACJ; PFDJ and ABC J: PFDJ and 
AP JJ; PFD1 and ATPJJ: PFD1 and .47P72; PPD7 and 5777; PPD7 and CDC3 7; 
PFD1 and CPR7; PFD1 and 775CS2: PF727 and 7C4P2: PFD1 and 1/757; PFD1 
and MGEJ; PFD1 and MP577; PP/>7 and 7VO&7; PFD1 and ECMJO; PFD1 and 

5 5547; PPD7 and 5542; PFP>7 and 55.43; PFD1 and 5544; PFD1 and 55C7; 
PFZ>7 and 55P2; PF737 and 5/17; PFD1 and 5157; PFD1 and 0PM7; PP737 and 
ORM2; PFD1 and PPP7: PFD1 and PPC2; PFD1 and P5£7: PFD1 and £75/4; 
PPD7 and 774 C7 or a truncated intronless HA CP, ABC1 and AP/7; 473C7 and 
ATP1P, ABCJ and 4.7P72; ABCJ and BTTJ; ABCJ and CDC37; ABC1 and CPR7; 

10 ^PC7 and /75CS2; ,4PC7 and KAR2; ABCJ and L7757; .4PC7 and MGEJ; ABCJ 
and M7J577; ABC1 and 7V057; ^73C7 and ECM1 0; ABCJ and 5547; .4PC7 and 
5542; ^PC7 and 5543; ABCJ and 5544; .4PC7 and 55C7; ^SC7 and SSE2; ABCJ 
and SJLJ-ABCJ and 5L57; ABCJ and ORMJ; ABCJ and ORM2; ABCJ and PPP7; 
.4/3C7 and PTC2; ABCJ and P5P7; ABCJ and £773/4; .473C7 and 77.4 C7 or a 

15 truncated intronless HACJ; ^PJ7 and .4PP77; .4PJ7 and ^PP72; APJJ and BTTJ: 
APJ1 and CDC37; APJJ and CPR7; APJJ and //5GS2; APJJ and 7C4P2; .4P77 and 
LHSJ; APJJ and MGEJ ; -4P/7 and M72577; .4P/7 and NOB J; APJJ and ECMJO; 
APJJ and 5547; y4P/7 and 5542; ^PJ7 and 55^3; ^PJ7 and 5544; APJJ and 
55C7; APJJ and SSE2; APJJ and 5/17; APJJ and 5L57; APJJ and ORMJ: APJJ 

20 and 07?M2; v4PJ7 and P£P7; APJJ and PPC2; ^PJ7 and P5P7; ^PJ7 and UBI4; 
APJJ and 774 C7 or a truncated intronless HACJ; ATPJJ and ^PP72; ^PP77 and 
BTTJ; ATPJJ and CDC37; ATPJJ and CPR7; ATPJJ and HSC82; ATPJJ and 
KAR2; ^PP77 and. LHSJ; ATPJJ and MGEJ; ATPJJ and MP577; ^PP77 and 
NOB J; ATPJJ and ECMJO; ATPJJ and 5547; ^7P77 and 5542; ^PP77 and 

25 SSA3; ATPJJ and 5544; ^PP77 and 55C7; ^PP77 and SSE2; ATPJJ and 5/Z7; 
^PP77 and SLSJ; ATPJJ and ORMJ; ATPJJ and ORM2; ATPJJ and PER J; 
ATPJJ and PTC2; ATPJJ and P5P7; .47P77 and UBI4; ATPJJ and HACJ or a 
truncated intronless HACJ; ^PP72 and BTTJ; ATPJ2 and CDC37; ATP12 and 
CPR7; ATPJ2 and 775CS2; 47P72 and KAR2; ATPJ2 and 7.775/; ^7P72 and 

30 MGEJ; ATPJ2 and 717P577; ^PP72 and A'OB7; 47P72 and ECMJO; ATP J 2 and 

55^7: 47P72 and 5542; .47P72 and SSA3; ATPJ2 and 5544; ^PP72 and 55C7; 
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ATP12 and SSE2: ATP12 and SILJ: ATP12 and SLS1: ATP 12 and ORM1: ATP12 

and ORM2; ATP12 and PERI: ATP 12 and PTC2; ATP12 and PSE1; ATP12 and 

£77374; ATP 12 and 7X4C7 or a truncated intronless HAC1; BTT1 and CDC 3 7; BTT1 

and CP7?7; 57T7 and HSC82; BTT1 and 7£47?2; BTT1 and L//57; 737T7 and 

5 MGET, BTT1 and M7?577; 737T7 and NOB1; BTT1 and ECM10; BTT1 and 5547 ; 

BTT1 and 5542; B7T7 and 5543; BTT1 and 55^^; 75m and SSC1: BTT1 and 

55£2; 737T7 and 57X7; 737T7 and SLS1: BTT1 and ORMT, BTT1 and ORM2; BTT1 

and P£7?7; 737T7 and PTC2; BTT1 and PSET, BTT1 and 1/574; 737T7 and- HAC J or 

a truncated intronless HAC1: CDC37 and C7>7?7; CDC 3 7 and 7/5C52; CDC3 7 and 

l o 7£47?2; C7)C3 7 and LHS1 ; C£>C3 7 and MGE1 ; CDC3 7 and MR57 7 ; C7>C3 7 and 

AfOB7; CDC37 and 7±CM70; CDC37 and 55.47; COC37 and 5542; C7XT37 and 

S&43; CDC37 and 5544; CDC 3 7 and 55C7; CDC37 and 55£2; CDC37 and 57Z7; 

CZ)C37 and 5157; CDC 37 and ORM7; CDC3 7 and ORM2; C7)C57 and 7>£7?7; 

C7>C37 and PTC2; CDC37 and 7>5£7; C7XT37 and UBI4; CDC37 and 7X4C7 or a 

15 truncated intronless HAC1; CP7?7 and 775CS2; CPR 7 and 7£47?2; CP7?7 and L7757 ; 

CPR7 and MGET, CPR7 and MK577; CP7?7 and NOB1: CPR7 and £CM70; CP7?7 

and 55.47; CPR7 and 55^2; CPR7 and 5543; CPR7 and 55.44; CP7?7 and 55C7; 

CP7?7 and SSE2; CPR7 and 57Z7; C7>7?7 and 5Z57; CP7?7 and ORM1; CPR7 and 

ORM2; CPR7 and PERI; CPR7 and PTC2; CPR7 and 7>5£7; CP7?7 and UBI4; 

20 CP7?7 and 774C7 or a truncated intronless 7X4C7; 775CS2 and KAR2; HSC82 and 

L7757; 775CS2 and MGEl; HSC82 and M7?577; 775CS2 and NOB1; HSC82 and 

ECM10; HSC82 and 5547; HSC82 and 5542; 775C52 and SSA3; HSC82 and 

5544; HSC82 and 55C7; HSC82 and SSE2; HSC82 and 577,7; 775C52 and 57,57; 

775C52 and ORM1; HSC82 and ORM2; HSC '8 '2 and PERT, 775C52 and PTC2; 

25 HSC82 and P57J7; 745C52 and UB14; HSC82 and HAC 1 or a truncated intronless 

HA CI; KAR2 and 7,7757; 7C4P2 and MGEl; KAR2 and MRS 11; KAR2 and NOB1; 

KAR2 and ECM10; KAR2 and 5547; 7C47?2 and SSA2: KAR2 and SSA3; KAR2 and 

SSA4; KAR2 and 55C7; 7647*2 and 55£2; KAR2 and 57L7; 7C47?2 and 57,57; KAR2 

and ORM1; KAR2 and ORM2; KAR2 and PERI; KAR2 and PTC2; KAR2 and 

30 PSEl; KAR2 and UBI4; KAR2 and 774C7 or a truncated intronless HAC1; LHS1 

and MGEl; 7,7757 and MR577; LHSJ and NOB1; L7757 and ECM10; LHS1 and 
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SSAJ; LHSJ and SSA2; LHSJ and SSA3; LHSJ and SSA4; LHSJ and SSC1; LHSJ 

and 55752; LHS1 and SILJ: LHSJ and SLSJ: LHSJ and ORMJ: LHSJ and ORM2: 

LHSJ and 7>£7G; 7.7757 and PTC2; LHSJ and 7>5£7; LHSJ and G574; L7/57 and 

H4CJ or a truncated intronless HACJ; MGEJ and M7?577; MGE1 and /VOB7; 

5 MGEJ and 75CM70; MG£7 and 5547; 7MGE7 and SSA2; MGEJ and 5543; MGEJ 

and 5544; 7\4G£7 and SSCJ; MGEJ and 55£2; MGE7 and 5717; MGEJ and 57,57; 

jl4G£7 and OJUH; MGEJ and ORM2; MGEJ and ££7?7; MG£7 and PTC2; 

MGEJ and P5£7; MG£7 and UBJ4: MGEJ and 7£4C7 or a truncated intronless 

HACJ: MRS J J and NOB J; MRS J J and ECMJO; MRS J J arid 5547; M7?577 and 

10 SSA2; MRSJ J and SSA3; MRSJJ and 55^4; MRSJJ and SSCJ; MRSJJ and 55£2; 

7147?577 and 577,7; M7?577 and 57.57; M7?577 and ORMJ; M7?577 and ORM2; 

MRSJJ and 7>£7G; M7?577 and PTC2; MRSJJ and PSEJ; MRSJJ and G7374; 

MK57 7 and HACJ or a truncated intronless 7i4C7; 7V0737 and ECMJO: NOB J and 

5547; NOB1 and 5542; 7^957 and 5545; NOBJ and 5544; 7VOB7 and 55C7; 

15 NOBJ and SSE2; NOBJ and 577.7; A'OB7 and 5Z57; NOBJ and ORMJ; NOBJ and 

G7?M2; A'G737 and 7>£R7; NOBJ and £7X2; NOBJ and £5£7; AT0737 and 7/7374; 

NOBJ and HACJ or a truncated intronless 774C7; ECMJO and 5547; ECMJO and 

5542; £GM70 and SSA3; ECMJO and 5544; ECMJO and 55C7; £CM70 and 

SSE2; ECMJO and 5717; £CM70 and 5157; £CM70 and ORMJ; £CM70 and 

20 ORM2: ECMJO and ££7?7; ECMJO and PTC2: ECMJO and £5£7; ECMJO and 

UBI4: ECMJO and 724C7 or a truncated intronless HACJ: SSAJ and 5542; 5547 

and 5543; 5547 and 5544; 5547 and 55C7; 5547 and SSE2: SSAJ and 57Z7; 5547 

and 5L57; 5547 and ORMJ; SSAJ and ORM2; SSAJ and PERJ; SSAJ and 7>7G2; 

5547 and £5£7; 5547 and 77574; 5547 and HACJ or a truncated intronless 774C7; 

25 SSA2 and SSA3; SSA2 and 5544; SSA2 and 55C7 ; SSA2 and 55£2; SSA2 and 57Z7 ; 

5542 and 5Z57; SSA2 and OTU47; SSA2 and 0£M2; 5542 and £££7; SSA2 and 

£7G2; 5542 and £5£7; SSA2 and G574; 5542 and ££4C7 or a truncated intronless 

HACJ; SSA3 and 5544; 5543 and 55C7; SSA3 and SSE2; SSA3 and 57X7; 5543 

and 5157; 5543 and G7?M7; SSA3 and 0£/\//2; 5545 and PERJ; SSA3 and 7TC2; 

30 SSA3 and £5£7; 5545 and UBJ4; SSA3 and 774C7 or a truncated intronless 774C7; 

5544 and 55C7; 5544 and SSE2; SSA4 and 5717; 5544 and 5Z57; 5544 and 
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ORMJ; SSA4 and ORM2; SSA4 and PERI: SSA4 and PTC2: SSA4 and £5£7; 

SSA4 and UBJ4: SSA4 and HAC1 or a truncated intronless HACJ: SSCJ and 55£2; 

SSCJ and 5777; 55C7 and 5757; SSCJ and 0£J77; 55C7 and ORM2; SSCJ and 

£££7; SSC1 and /TC2; 55C7 and £5£7; SSCJ and [7£74; 55C7 and HAC1 or a 

truncated intronless HACJ; SSE2 and 5/17; SSE2 and 5757; 55£2 and ORM1; 

SSE2 and ORM2; 55£2 and £££7; SSE2 and ££C2; SSE2 and £5£7; 55£2 and 

UBI4; SSE2 and 774 C7 or a truncated intronless HACJ; SIL1 and 5X57; £717 and 

<9£M7; 5717 and 07?M2; 5717 and £££7; 5777 and PTC2; SIL1 and £5£7; 5777 

and JJBI4; SIL1 and 774C7 or a truncated intronless 774 C7; SLS1 and ORMJ: SLS1 

and ORM2; SLS1 and £££7; 5157 and PTC2; SLS1 and PSE1; SLS1 and UBJ4; 

SLS1 and 77/1C7 or a truncated intronless 774C7; OKhdl and ORM2; ORM1 and 

£££7; 0£M7 and PTC2; ORMJ and £5£7; <9£M7 and UBI4; ORMJ and 774 C7 

or a truncated intronless HACJ; ORM2 and £££7; 0£M2 and PTC2; ORM2 and 

£5£7; ORM2 and L£74; ORM2 and HACJ or a truncated intronless HACJ;PERJ 

and £7U2; £££7 and £5£7; PER J and ££7¥; £££7 and 774 C7 or a truncated 

intronless 774C7; PTC2 and £5£7; PTC2 and 17574; ££C2 and HACJ or a 

truncated intronless 774C7; PSEJ and L7£W; £5£7 and 774C7 or a truncated 

intronless HACJ: UBJ4 and HACJ or a truncated intronless 774C7; TIM9 and 

,47747/ £7MP and CCT2; TIM9 and COT; 77MP and CCT4; TIM9 and CCT5; 

TIM9 and CCT6; 77MP and CCT7; TIM9 and CCT5; £7MP and CNSJ; TIM9 and 

C££3; TIM9 and C££(); 7YM9 and £7*07; TJM9 and EUGJ; TIM9 and £M07; 

TIM9 and 77C777; TIM9 and 775£70; £7MP and HSPJ2; TIM9 and 775£704; £7MP 

and HSP26\ TIM9 and 775£50; 77MP and 775£42; TIM9 and HSP60; TIM9 and 

775£7S; £7MP and 775£S2; £7MP and ,7£M7; TIM9 and MDJ1; TIM9 and MDJ2; 

TIM9 and 717£7>7; T7MP and MPD2; TIM9 and £7)77; £7MP and £££7; TIM9 and 

^5C7; £7MP and ^£J7; TIM9 and ATP 11; TIM9 and ,4££72; £7MP and £££7; 

£7MP and CDC37; TIM9 and C£7?7; 77MP and 775C52; £7MP and KAR2; TIM9 

and 77757; 77MP and MG£7; £7MP and M£577; TIM9 and NOB J; TIM9 and 

ECM10; TIM9 and 55,47; TJM9 and SSA2; TJM9 and SSA3; TIM9 and SSA4; TIM9 

and 55C7; TIM9 and 55£2; £7A7P and 5777; £7MP and 5757; £7MP and <9£M7; 

TIM9 and ORM2; TIM9 and £££7; TIM9 and PTC2; TIM9 and £5£7; TJM9 and 
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UB14: TIM9 and HAC1 or a truncated intronless HACJ; P AMI 8 and AHA1; 
PAM18 and CCT2; PAM18 and CCT3: PAM18 and CCT4\ PAM18 and CCT5; 
PAM18 and CCT6: PAMJ8 and CCT7; PAM18 and CCPS; P^M7S and CNS1; 
PAMJ8 and CPPJ; and CPR6: PAM18 and PP07; PAM18 and PC7G7; 

5 P.4M7S and FMOl ; P.4M7S and 77C777; P.4M7S and 77SP70: P.4M7S and 7PSP72; 
PAM18 and HSPJ04; PAM18 and 77SP26; 7 > ^M75 and HSP30: P AMI 8 and 
HSP42: PAM18 and T/SPtfO; P.4M7S and HSP78; PAM18 and 775PS2: PAM18 
and J£M7; P.47WS and MDJ1; PAM18 and MA72; P^M75 and MP737: P,4M/S 
and MPD2; PAM18 and PP>77; P.4M7S and PFD1- PAM18 and ^75C7; P.4M/S 

10 and APJ]\ PAM18 and .4PP77; PAM18 and .4PP72; P4M7S and BTTJ; PAM18 
and CDC 37; PAM18 and CPR7: PAM18 and 77SCS2: 7 } ^M7«S and 7C4P2; P.4M7S 
and LHS1; PAM18 and MOE7; ^7^75 and MPS77; PAM18 and 7^9737; PAM18 
and ECM10; PAM18 and S&47; P.4M7S and 55^2; PAM18 and S&43; P^!M7S and 
SSA4; PAM18 and 5'5C7: P/1M7S and SSE2; PAM18 and S7P7: PAM18 and SXS7; 

15 PAM7S and ORMh PAM18 and ORM2; PAM18 and P£P7; P^M7S and PTC2; 
PAM18 and PS£7; P^M75 and UBI4; PAM18 and 7£4C7 or a truncated intronless 
i£4C7; PCP7 and AHA1; TCP1 and CCT2- TCP1 and CCT3: TCP1 and CCT4; 
TCP1 and CCT5; TCP J and CCT6; TCP! and CCP7; PCP7 and CCT8: TCP! and 
CNSJ; TCP1 and CPP3; PCP7 and CPP<5; PCP7 and PP<97; PCP7 and ££7G7; 

20 PCP7 and FMG7; 7XP7 and HCHJ; TCP1 and HSP10; TCP1 and 77SP72; TCP1 
and #SP704; PCP7 and HSP26: PCP7 and 77SP30; PCP7 and 775P42; PCP7 and 
HSP60; TCP1 and HSP78; TCP] and 775P52; PCP7 and JEM 7; TCP J and MDJT, 
TCP] and MDJ2; TCP] and MPD7; PCP7 and MPD2; TCP] and PDI1; PCP7 
and PFP>7; PCP7 and ABC]; PCP7 and ^PJ7; TCP 1 and ^PP77; PCP7 and 

25 ^PP72; PCP7 and BTT]\ TCP] and CDC37; TCP] and CPR7; TCP] and 77SGS2; 
PCP7 and KAR2; TCP] and LHSJ; TCP] and MGEJ; TCP] and MP577; TCP] 
and A'G737; TCP 7 and ECM10; TCP] and SSAJ; TCP] and SSA2; TCP] and 
SSA3; TCP] and SSA4; TCP] and 55C7; PCP7 and SSE2; TCP] and 57P7; PCP7 
and SLS]; TCP] and OKM7; PCP7 and ORM2: TCP] and P£7?7; TCP 7 and 

30 PTC 2; TCP] and PSE7; 7GP7 and f7/37f; PCP7 and 7£4C7 or a truncated 

intronless 7£4C7; TIM9 and P^tM7S; P7M 9 and TCP] ; or PAM18 and TCP1. 
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The first, second and third recombinant genes may, or may not, each individually 
be present on a plasmid within the host cell (which may, or may not, be a 2um- 
family plasmid, as discussed above) or be chromosomally integrated within the 
genome of the host cell. It will be appreciated that any combination of plasmid 
and chromosomally integrated first, second and third recombinant genes may be 
Used. For example, the first, second and third recombinant genes may, or may not, 
each individually be present on a plasmid, and this may, or may not, be either the. 
same plasmid or different plasmids. Alternatively, the first recombinant gene may, 
or may not, be present on a plasmid, and second and third recombinant genes may, 
or may not, be chromosomally integrated within the genome of the host cell. 
Alternatively, the first and second recombinant genes may, or may not, be present 
on a plasmid and the third recombinant gene may, or may not, be chromosomally 
integrated within the genome of the host cell. Alternatively, the first and third 
recombinant genes may, or may not, be present on a plasmid and the second 
recombinant gene may, or may not, be chromosomally integrated within the 
genome of the host cell. Alternatively, the first and second recombinant gene 
may, or may not, be chromosomally integrated within the genome of the host cell 
and the third recombinant gene may, or may not, be present on a plasmid. 
Alternatively, the first, second and third recombinant genes may, or may not, each 
individually be chromosomally integrated within the genome of the host cell. 

Plasmids used for this purpose may, or may not, be plasmids, such as 2um-family 
plasmids, as defined below. Thus, in one embodiment, a method according to the 
first aspect of the invention does not involve a host cell in which the first, second 
and third recombinant genes are all present on the 2um-family plasmid. 

Accordingly, as a second aspect, the present invention also provides a plasmid 
wherein the plasmid comprises two different genes (the first and second 
recombinant genes) encoding different chaperones. In one preferred embodiment, 
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the plasmid may, or may not, further comprise a gene encoding a heterologous 
protein (the third recombinant gene), such as a heterologous protein as described 
above. A plasmid according to the second aspect of the invention may, or may 
not, be a 2um-family plasmid. 

A third aspect of the present invention provides for the use of the plasmid of the 
second aspect of the invention as an expression vector to increase the production 
of a desired protein, including as heterologous protein, such as a fungal 
(optionally yeast) or vertebrate protein. The desired protein may, or may not, be 
encoded by a recombinant gene that is present as part of the plasmid, . or present in 
the host cell on a different plasmid, or present in the host cell as a transgene that is 
integrated in the host cell's chromosome. 

A fourth aspect of the invention provides a host cell comprising a plasmid as 
defined above. The host cell may, or may not, further comprise a recombinant 
gene encoding a desired heterologous protein. Where the recombinant gene that 
encodes the desired heterologous protein (the "third recombinant gene") is not 
present as part of the same plasmid that encodes the first and second chaperones, 
then the host cell may, or may, not, comprise the third recombinant gene on a 
different plasmid, or as a transgene that is integrated in the host cell's 
chromosome. 

As a fifth aspect, the present invention provides a host cell which comprises the 
first, second and third recombinant genes. The first, second and third recombinant 
genes may, or may not, each individually be present on a plasmid within the host 
cell (which may, or may not, be a 2um-family plasmid, as discussed above) or be 
chromosomally integrated within the genome of the host cell. It will be 
appreciated that any combination of plasmid and chromosomally integrated first, 
second and third recombinant genes may be used, as discussed, above. Thus, the 
host cell may, or may not, comprise the first, second and third recombinant genes 
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each individually present on a plasmid, and this may, or may not, be either the 
same plasmid or different plasmids. Alternatively, the host cell may, or may not, 
comprise the first recombinant gene on a plasmid, and second and third 
recombinant genes chromosomally integrated within the genome of the host cell. 
Alternatively, the host cell may, or may not, comprise the first and second 
recombinant genes on a plasmid and the third recombinant gene chromosomally 
integrated within the genome of the host cell. Alternatively, the host cell may, or 
may not, comprise the first and third recombinant genes on a plasmid and the 
second recombinant gene chromosomally integrated within the genome of the host 
cell. Alternatively, the host cell may, or may not, comprise the first and second 
recombinant genes chromosomally integrated within the genome of the host cell 
and the third recombinant gene present on a plasmid. Alternatively, the host cell 
may, or may not, comprise the first, second and third recombinant genes each 
individually chromosomally integrated within the genome of the host cell. 

The 2um-familv plasmids: 

For the purposes of the present invention, a plasmid may, or may not, be a 2 urn- 
family plasmid. Certain closely related species of budding yeast have been shown 
to contain naturally occurring circular double stranded DNA plasmids. These 
plasmids, collectively termed 2um-family plasmids, include pSRl, pSB3 and 
pSB4 from Zygosaccharomyces rouxii (formerly classified as Zygosaccharomyces 
bisporus), plasmids pSBl and pSB2 from Zygosaccharomyces bailii, plasmid 
pSMl from Zygosaccharomyces fermentati, plasmid pKDl from Kluyveromyces 
drosphilarum, an un-named plasmid from Pichia membranaefaciens (hereinafter 
"pPMl") and the 2pm plasmid (such as shown in Figure 1) and variants (such as 
Scpl, Scp2 and Scp3) from Saccharomyces cerevisiae (Volkert, et ah, 1989, 
Microbiological Reviews, 53, 299; Murray et ah, 1988, X Mol. Biol. 200, 601; 
Painting, et ah, 1984, J. Applied Bacteriology, 56, 331). 
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As a family of plasmids these molecules share a series of common features in that 
they typically possess two inverted repeats on opposite sides of the plasmid, have 
a similar size around 6-kbp (range 4757 to 6615-bp), three open reading frames,, 
one of which encodes for a site specific recombinase {FLP) and an autonomously 
replicating sequence (ARS), also known as an origin of replication (ori), located 
close to the end of one of the inverted repeats. (Futcher, 1988, Yeast, 4, 27; 
Murray et a!., op. cit., and Toh-e et al, 1986, Basic Life Sci. 40, 425). Despite 
their lack of discernible DNA sequence homology, their shared molecular 
architecture and the conservation of function of the three open reading frames 
have demonstrated a common ancestral link between the family members. 

The above naturally occurring 2um-family plasmids may, or may not, be used in 
the present invention, but this invention is not limited to the use of naturally 
occurring 2um-family plasmids. For the purposes of this invention, a 2um-family 
plasmid may, or may not, be as described below. 

A 2um-family plasmid is a circular, double stranded, DNA plasmid. It is typically 
small, such as between 3,000 to 10,000 bp, optionally between 4,500 to 7000 bp, 
excluding recombinantly inserted sequences. 

A 2um-family plasmid typically comprises at least three open reading frames 
("ORFs") that each encodes a protein that functions in the stable maintenance of 
the 2um-family plasmid as a multicopy plasmid. The proteins encoded by the 
three ORFs can be designated FLP, REP1 and REP2. Where a 2um-family 
plasmid comprises not all three of the ORFs encoding FLP, REP1 and REP2 then 
ORFs encoding the missing protein(s) should be supplied in trans, either on 
another plasmid or by chromosomal integration. 

A "FLP" protein is a protein capable of catalysing the site-specific recombination 
between inverted repeat sequences recognised by. FLP. The inverted repeat 
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sequences are termed FLP recombination target (FRT) sites and each is typically 
present as part of a larger inverted repeat (see below). Preferred FLP proteins 
comprise the sequence of the FLP proteins encoded by one of plasmids pSRl, 
pSBl, pSB2, pSB3, pSB4, pSMl, pKDl, pPMl and the 2pm plasmid, for 
example as described in Volkert et al, op. cit, Murray et al, op. ext., and Painting 
el al, pp. cit. Variants and fragments of these FLP proteins are also included in 
the present invention. "Fragments" and "variants" are those which retain the ability 
of the native protein to catalyse the site-specific recombination between the same 
FRT sequences. Such variants and fragments will usually have at least 50%, 60%, 
70%, 80%, 90%, 95%, 98%, 99%, or more, homology with an FLP protein encoded 
by one of plasmids pSRl, pSBl, pSB2, pSB3, pSB4, pSMl, pKDl, pPMl and the 
2pm plasmid. Different FLP proteins can have different FRT sequence 
specificities. A typical FRT site may, or may not, comprise a core nucleotide 
sequence flanked by inverted repeat sequences. In the 2pm plasmid, the FRT core 
sequence is 8 nucleotides in length and the flanking inverted repeat sequences are 
13 nucleotides in length (Volkert et al, op. cit.). However the FRT site recognised 
by any given FLP protein may, or may not, be different to the 2pm plasmid FRT 
site. 

REP1 and REP2 are proteins involved in the partitioning of plasmid copies during 
cell division, and may, or may not, also have a role in the regulation of FLP 
expression. Considerable sequence divergence has been observed between REP1 
proteins from different 2pm-family plasmids, whereas no sequence alignment is 
possible between REP2 proteins derived from different 2pm-family plasmids. 
Preferred REP 1 and REP2 proteins comprise the sequence of the REP1 and REP 2 
proteins encoded by one of plasmids pSRl, pSBl, pSB2, pSB3, pSB4, pSMl, 
pKDl, pPMl and the 2pm plasmid, for example as described in Volkert et al, op. 
cit., Murray et al, op. cit., and Painting et al, op. cit. Variants and fragments of 
these REP J and REP2 proteins are also included in the present invention. 
"Fragments" and "variants" of REP 1 and REP2 are those which, when encoded by 
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the plasmid in place of the native ORF, do not substantially disrupt the stable 
multicopy maintenance of the plasmid within a suitable yeast population. Such 
variants and fragments of REP 1 and REP 2 will usually have at least 5%, 10%, 20%, 
30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, 99%, or more, homology with a 
5 REP1 and REP 2 protein, respectively, as encoded by one of plasmids pSRl, 
pSBl, pSB2, P SB3,pSB4, pSMl, pKDl, pPMl and the 2um plasmid. 

The REP1 and REP2 proteins encoded by the ORFs on the plasmid must be 
compatible. It is preferred that the REP J and REP2 proteins have the sequences 
10 of REP 1 and REP2 proteins encoded by the same naturally occurring 2um-family 
plasmid, such as pSRl, pSBl, P SB2, P SB3, P SB4, pSMl, pKDl, pPMl and the 
2um plasmid, or variant or fragments thereof. 

A 2um-family plasmid typically comprises two inverted repeat sequences. The 
15 inverted repeats may be any size, so long as they each contain an FRT site (see 
above). The inverted repeats are typically highly homologous. They may, or may 
not, share greater than 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, 
99.5% or more sequence identity. In a preferred embodiment they are identical. 
Typically the inverted repeats are each between 200 to 1000 bp in length. 
20 Preferred inverted repeat sequences may, or may not, each have a length of from 
200 to 300 bp, 300 to 400 bp, 400 to 500 bp, 500 to 600 bp, 600 to 700 bp, 700 to 
800 bp, 800 to 900 bp, or 900 to 1000 bp. Particularly preferred inverted repeats 
are those of the plasmids pSRl (959 bp), pSBl (675 bp), pSB2 (477 bp), pSB3 
(391 bp), pSMl (352 bp), pKDl (346 bp), the 2um plasmid (599 bp), pSB4 or 
25 pPMl. 

The sequences of the inverted repeats may,, or may not, be varied. However, the 
sequences of the FRT site in each inverted repeat should be compatible with the 
specificity of the FLP protein encoded by the plasmid, thereby to enable the 
30 encoded FLP protein to act to catalyse the site-specific recombination between the 
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inverted repeat sequences of the plasmid. Recombination between inverted repeat 
sequences (and thus the ability of the FLP protein to recognise the FRT sites with 
the plasmid) can be determined by methods known in the art. For example, a 
plasmid in a yeast cell under conditions that favour FLP expression can be 

5 assayed for changes in the restriction profile of the plasmid which would result 
from a change in the orientation of a region of the plasmid relative to another 
region of the plasmid. The detection of changes in restriction profile indicate that 
the FLP protein is able to recognise the FRT sites in the plasmid and therefore that 
the FRT site in each inverted repeat are compatible with the specificity of the FLP 

1 o protein encoded by the plasmid. 

In a particularly preferred embodiment the sequences of inverted repeats, 
including the FRT sites, are derived from the same 2|am-family plasmid as the 
ORF encoding the FLP protein, such as pSRl, pSBL pSB2, pSB3, pSB4, pSMl, 
15 pKDl, pPMl or the 2^im plasmid. 

The inverted repeats are typically positioned with the 2jxm-family plasmid such 
that the two regions defined between the inverted repeats (e.g. such as defined as 
UL and US in the 2^m plasmid) are of approximately similar size, excluding 
20 exogenously introduced sequences such as transgenes. For example, one of the 
two regions may, or may not, have a length equivalent to at least 40%, 50%, 60%, 
70%, 80%, 90%, 95% or more, up to 100%, of the length of the other region. 

A 2jam-family plasmid typically comprises the ORF that encodes FLP and one 
25 inverted repeat (arbitrarily termed "IR1" to distinguish it from the other inverted 
repeat mentioned in the next paragraph) juxtaposed in such a manner that IR1 
occurs at the distal end of the FLP ORF, without any intervening coding sequence, 
for example as seen in the 2^im plasmid. By "distal end" in this context we mean 
the end of the FLP ORF opposite to the end from which the promoter initiates its 
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transcription. In a preferred embodiment, the distal end of the FLP ORF overlaps 
with IRl. 

A 2um-family plasmid typically comprises the ORF that encodes REP2 and the 
5 other inverted repeat (arbitrarily termed "IR2" to distinguish it from IRl 
mentioned in the previous paragraph) juxtaposed in such a manner that IR2 occurs 
at the distal end of the REP 2 ORF, without any intervening coding sequence, for 
example as seen in the 2um plasmid. By "distal end" in this context we mean the 
end of the REP 2 ORF opposite to the end from which the promoter initiates its 
10 transcription. 

In one embodiment, the ORFs encoding REP2 and FLP may, or may not, be 
present on the same region of the two regions defined between the inverted repeats 
of the 2um-family plasmid, which region may be the bigger or smaller of the 
15 regions (if there is any inequality in size between the two regions). 

In one embodiment, the ORFs encoding REP2 and FLP may, or may not, be 
transcribed from divergent promoters. 



20 



25 



Typically, the regions defined between the inverted repeats (e.g. such as defined 
as UL and US in the 2um plasmid) of a 2um-family plasmid may, or may not, 
comprise not more than two endogenous genes that encode a protein that functions 
in the stable maintenance of the 2um-family plasmid as a multicopy plasmid. 
Thus in a preferred embodiment, one region of the plasmid defined between the 
inverted repeats may, or may not, comprise not more than the ORFs encoding FLP 
and REP2; FLP and REP J; or REP J and REP2, as endogenous coding sequence. 



A 2um-family plasmid typically comprises an origin of replication (also known as 
an "autonomously replicating sequence - "ARS"), which is typically bidirectional. 
30 Any appropriate ARS sequence can be present. Consensus sequences typical of 
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yeast chromosomal origins of replication may, or may not be appropriate (Broach 
et al, 1982 ; Cold Spring Harbor Symp. Quant. Biol.., 47, 1165-1174: Williamson, 
Yeast, 1985, 1, 1-14). Preferred ARSs include those isolated from pSRL pSBl, 
pSB2, pSB3, pSB4 ; pSMl, pKDl, pPMl and the 2um plasmid. 

Thus, a preferred 2um-family plasmid may, or may not, comprise ORFs encoding 
FLP, REP1 and REP '2, two inverted repeat sequences each inverted repeat 
comprising an FRT site compatible with the encoded FLP protein, and an ARS 
sequence. Preferably the FRT sites are derived from the same 2um-family 
plasmid as the sequence of the encoded FLP protein. More preferably the 
sequences of the encoded REP1 and REP2 proteins are derived from the same 
2^m-family plasmid as each other. Even more preferably, the FRT sites are 
derived from the same 2um-family plasmid as the sequence of the encoded FLP , 
REP1 and REP2 proteins. Yet more preferably, the sequences of the ORFs 
encoding FLP, REP1 and REP2, and the sequence of the inverted repeats 
(including the FRT sites) are derived from the same 2um-family plasmid. 
Furthermore, the ARS site may, or may not, be derived from the same 2nm-family 
plasmid as one or more of the ORFs of FLP, REP1 and REP 2, and the sequence of 
the inverted repeats (including the FRT sites). 

The term "derived from" includes sequences having an identical sequence to the 
sequence from which they are derived. However, variants and fragments thereof, 
as defined above, are also included. For example, an FLP gene having a sequence 
derived from the FLP gene of the 2um plasmid may, or may not, have a modified 
promoter or other regulatory sequence compared to that of the naturally occurring 
gene. Additionally or alternatively, an FLP gene having a sequence derived from 
the FLP gene of the 2um plasmid may, or may not, have a modified nucleotide 
sequence in the open reading frame which may, or may not, encode the same 
protein as the naturally occurring gene, or may, or may not, encode a modified 
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FLP protein. The same considerations apply to other sequences on a 2um-family 
plasmid having a sequence derived from a particular source. 

Optionally, a 2um-family plasmid may, or may not, comprise a region derived 
5 from the STB region (also known as REPS) of the 2um plasmid, as defined in 
Volkert et al, op. cit. The STB region in a 2um-family plasmid of the invention 
may, or may not, comprise two or more tandem repeat sequences, such as three, 
four, five or more. Alternatively, no tandem repeat sequences may be present. 
The tandem repeats may be any size, such as 10, 20, 30, 40, 50, 60 70, 80, 90, 1 00 
10 bp or more in length. The tandem repeats in the STB region of the 2\xxn. plasmid 
are 62 bp in length. It is not essential for the sequences of the tandem repeats to 
be identical. Slight sequence variation can be tolerated. It may, or may not, be 
preferable to select an STB region from the same plasmid as either or both of the 
REP1 and REP2 ORFs. The STB region is thought to be a cw-acting element and 
15 preferably is not transcribed. 

Optionally, a 2um-family plasmid may, or may not, comprise an additional ORF 
that encodes a protein that functions in the stable maintenance of the 2um-family 
plasmid as a multicopy plasmid. The additional protein can be designated RAF or 

20 D. ORFs encoding the RAF or D gene can be seen on, for example, the 2um 
plasmid and pSMl. Thus a RAF or D ORF can comprise a sequence suitable to 
encode the protein product of the RAF or D gene ORFs encoded by the 2 urn 
plasmid or pSMl, or variants and fragments thereof. Thus variants and fragments 
of the protein products of the RAF or D genes of the 2um plasmid or pSMl are 

25 also included in the present invention. "Fragments" and "variants" of the protein 
products of the RAF or D genes of the 2um plasmid or pSMl are those which, 
when encoded by the 2 urn plasmid or pSMl in place of the native ORF, do not 
disrupt the stable multicopy maintenance of the plasmid within a suitable yeast 
population. Such variants and fragments will usually have at least 5%, 1 0%, 20%, 

30 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, 99%, or more, homology with 
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the protein product of the RAF or D gene ORFs encoded by the 2pm plasmid or 
pSMl. 

A naturally occurring 2pm-family plasmid may, or may not, be preferred. A 
naturally occurring 2pm-family plasmid is any plasmid having the features 
defined above, which plasmid is found to naturally exist in yeast, i.e. has not been 
recombinantly modified to include heterologous sequence. Optionally the 
naturally occurring 2pm-family plasmid is selected from pSRl (Accession No. 
X02398), pSB3 (Accession No. X02608) or pSB4 as obtained from 
Zygosaccharomyces rouxii, pSBl or .pSB2 (Accession No. NC_002055 or 
Ml 8274) both as obtained from Zygosaccharomyces bailli, pSMl (Accession No. 
NC_002054) as obtained from Zygosaccharomyces fermentati, pKDl (Accession 
No. X03961) as obtained from Kluyveromyces drosophilarum, pPMl from Pichia 
membranaefaciens or, preferably, the 2pm plasmid (Accession No. NC_001398 or 
JO 1347) as obtained from Saccharomyces cerevisiae. Accession numbers in this 
paragraph refer to NCBI deposits. 

The 2pm plasmid (Figure 1) is a 6,318-bp double-stranded DNA plasmid, 
endogenous in most Saccharomyces cerevisiae strains at 60-100 copies per 
haploid genome. The 2pm plasmid comprises a small unique (US) region and a 
large unique (UL) region, separated by two 599-bp inverted repeat sequences. 
Site-specific recombination of the inverted repeat sequences results in inter- 
conversion between the A-form and B-form of the plasmid in vivo (Volkert & 
Broach, 1986, Cell, 46, 541). The two forms of 2pm differ only in the relative 
orientation of their unique regions. 

' While DNA sequencing of a cloned 2 pm plasmid (also known as Scpl) from 
Saccharomyces cerevisiae gave a size of 6,318-bp (Hartley and Donelson, 1980, 
Nature, 286, 860), other slightly smaller variants of 2pm, Scp2 and Scp3, are 
known to exist as a result of small deletions of 125-bp and 220-bp, respectively, in 
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a region known as STB (Cameron et al, 1977, Nucl. Acids Res.., 4, 1429: Kikuchi, 
1983, Cell, 35, 487 and Livingston & Hahne, 1979, Proc. Natl. Acad. Sci. USA, 
76, 3727). In one study about 80% of natural Saccharomyces strains from around 
the world contained DNA homologous to 2um (by Southern blot analysis) 
5 (Hollenberg, 1982, Current Topics in Microbiology and Immunobiology, 96, 119). 
Furthermore, variation (genetic polymorphism) occurs within the natural 
population of 2um plasmids found in S. cerevisiae and S. carlsbergensis, with the 
NCBI sequence (accession number NC_00 13 98) being one example. 

10 The 2um plasmid has a nuclear localisation and displays a high level of mitotic 
stability (Mead et al, 1986, Molecular & General Genetics, 205, 417). The 
inherent stability of the 2um plasmid results from a plasmid-encoded copy 
number amplification and partitioning mechanism, which can be compromised 
during the development of chimeric vectors (Futcher & Cox, 1984, J. Bacterial., 

15 157, 283; Bachmair & Ruis, 1984, Monatshefte fur Chemie, 115, 1229). A yeast 
strain, which contains a 2 (am plasmid is known as [cir + ], while a yeast strain 
which does not contain a 2um plasmid is known as [cir 0 ]. 

The US-region of the 2um plasmid contains the REP2 and FLP genes, and the 
20 UL-region contains the REP1 and D (also known as RAF) genes, the STB-locus 
and the origin of replication (Broach & Hicks, 1980, Cell, 21, 501; Sutton & 
Broach, 1985, Mol. Cell. Biol, 5, 2770). The Flp recombinase binds to FRT-sites 
(Flp Recognition Target) within the inverted repeats to mediate site-specific 
recombination, which is essential for natural plasmid amplification and control of 
25 plasmid copy number in vivo (Senecoff et al, 1985, Proc. Natl. Acad. Sci. U.S.A., 
82, 7270; Jayaram, 1985, Proc. Natl. Acad. Sci. U.S.A., 82, 5875). The copy 
number of 2um-family plasmids can be significantly affected . by changes in Flp 
recombinase activity (Sleep et al, 2001, Yeast, 18, 403; Rose & Broach, 1990, 
Methods Enzymol, 185, 234). The Repl and Rep2 proteins mediate plasmid 
30 segregation, although their mode of action is unclear (Sengupta et al, 2001, J. 
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Bacteriol, 183, 2306). They also repress transcription of the FLP gene (Reynolds 
etal, 1987, Mol. Cell. Biol, 7, 3566). 

The FLP and REP2 genes of the 2u.m plasmid are transcribed from divergent 
promoters, with apparently no intervening sequence defined between them. The 
FLP and REP2 transcripts both terminate at the same sequence motifs within the 
inverted repeat sequences, at 24-bp and 178-bp respectively after their translation 
termination codons (Sutton & Broach, 1 985, Mol. Cell. Biol, 5, 2770). 

In the case of FLP, the C-terminal coding sequence also lies within the inverted 
repeat sequence. Furthermore, the two inverted repeat sequences are highly 
conserved over 599-bp, a feature considered advantageous to efficient plasmid 
replication and amplification in vivo, although only the FRT-sites (less than 65- 
bp) are essential for site-specific recombination in vitro (Senecoff et al, 1985, 
Proc. Natl Acad. Sci. U.S.A., 82, 7270; Jayaram, 1985, Proc. Natl Acad. Sci. 
U.S.A., 82, 5875; Meyer-Leon et al, 1984, Cold Spring Harbor Symposia On 
Quantitative Biology, 49, 797). The key catalytic residues of Flp are arginine-308 
and tyrosine-343 (which is essential) with strand-cutting facilitated by histidine- 
309 and histidine 345 (Prasad et al, 1987, Proc. Natl Acad. Sci. U.S.A., 84, 2189; 
Chen et al, 1992, Cell, 69, 647; Grainge et al, 2001, J. Mol. Biol, 314, 717). 

Two functional domains are described in Rep2. Residues 15-58 form a Repl- 
binding domain, and residues 59-296 contain a self-association and STB-binding 
region (Sengupta etal, 2001, J. Bacteriol, 183, 2306). 

Chimeric or large deletion mutant derivatives of 2um which lack many of the 
essential functional regions of the 2um plasmid but retain the functional cis 
element ARS and STB, cannot effectively partition between mother and daughter 
cells at cell division. Such plasmids can do so if these functions are supplied in 
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trans, by for instance the provision of a functional 2pm plasmid within the host, 
such as a [cir + ] host. 

Genes of interest have previously been inserted into the UL-region of the 2 urn 
plasmid. For example, see plasmid pSAC3Ul in EP 0 286 424 and the plasmid 
shown in Figure 2 of WO 2005/061718, which includes a p-lactamase gene (for 
ampicillin resistance), a LEU2 selectable marker and an oligonucleotide linker, the 
latter two of which are inserted into a unique SwaBI-site within the UL-region of 
the 2um-like disintegration vector, pSAC3 (see EP 0 286 424). The E, coli DNA 
between the A7>al-sites that contains the ampicillin resistance gene is lost from the 
plasmid shown in Figure 2 of WO 2005/061718 after transformation into yeast. 
This is described in Chinery & Hinchliffe, 1989, Curr. Genet., 16, 21 and EP 0 
286 424, where these types of vectors are designated "disintegration vectors". 
Further polynucleotide insertions can be made in a Notl-site within a linker (Sleep 
et al, 1991, Biotechnology (N Y), 9, 183). 

Alternative insertion sites in 2pm plasmid are known in the art, including those 
described in Rose & Broach (1990, Methods Enzymol, 185, 234-279), such as 
plasmids pCV19, pCV20, CV neo , which utilise an insertion at EcoRl in FLP, 
plasmids pCV21, pGT41 and pYE which utilise EcoKL in D as the insertion site, 
plasmid pHKB52 which utilises Pstl in D as the insertion site, plasmid pJDB248 
which utilises an insertion at Pstl in D and EcoRl in D, plasmid p.TDB219 in 
which Pstl in D and EcoRl in FLP are used as insertion sites, plasmid Gl 8, 
plasmid pAB18 which utilises an insertion at CM in FLP, plasmids pGT39 and 
pA3, plasmids pYTll, pYT14 and pYTll-LEU which use Pstl in D as the 
insertion site, and plasmid PTY39 which uses EcoW in FLP as the insertion site. 
Other 2pm plasmids include pSAC3, pSAC3Ul, pSAC3U2, pSAC300, pSAC310, 
P SAC3C1, pSAC3PLl, pSAC3SL4, and pSAC3SCl are described in EP 0 286 
424 and Chinery & Hinchliffe (1989, Curr. Genet., 16, 21-25) which also 
described Pstl, Eagl or SnaBl as appropriate 2pm insertion sites. Further 2pm 
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plasmids include pAYE255, pAYE316, pAYE443, pAYE522 (Kerry-Williams et 
al, 1998, Yeast, 14, 161-169), pDB2244 (WO 00/44772), and pAYE329 (Sleep et 
al 2001, Yeast, 18, 403-421). 

In one preferred embodiment, one or more genes are inserted into a 2um-family 
plasmid within an untranscribed region around the ARS sequence. For example, 
in the 2um plasmid obtained from S. cerevisiae, the untranscribed region around 
the ARS sequence extends from the end of the D gene to the beginning of ARS 
sequence. Insertion into SnaBl (near the origin of replication sequence ARS) is 
described in Chinery & Hinchliffe, 1989, Curr. Genet., 16, 21-25. The skilled 
person will appreciate that gene insertions can also be made in the untranscribed 
region at neighbouring positions to the SnaBl site described in Chinery & 
Hinchliffe. 

In another preferred embodiment, REP2 and FLP genes in a 2um-family plasmid 
each have an inverted repeat adjacent to them, and one or more genes are inserted 
into a 2um-family plasmid within the region between the first base after the last 
functional codon of either the REP 2 gene or the FLP gene and the last base before 
the FRT site in the inverted repeat adjacent to said gene. The last functional 
codon of either a REP 2 gene or a FLP gene is the codon in the open reading frame 
of the gene that is furthest downstream from the promoter of the gene whose 
replacement by a stop codon will lead to an unacceptable loss of multicopy 
stability of the plasmid, as defined herein. Thus, disruption of the REP2 or FLP 
genes at any point downstream of the last functional codon in either gene, by 
insertion of a polynucleotide sequence insertion, deletion or substitution will not 
lead to an unacceptable loss of multicopy stability of the plasmid. 

For example, the REP2 gene of the 2um plasmid can be disrupted after codon 59 
and that the FLP gene of the 2um plasmid can be disrupted after codon 344, each 
without a loss of multicopy stability of the plasmid. The last functional codon in 
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equivalent genes in other 2um-family piasmids can be determined routinely by 
making mutants of the piasmids in either the FLP or REP2 genes and following 
the tests set out herein to determine whether the plasmid retains multicopy 
stability. Thus, a plasmid insertion site as defined in WO 2005/061719 may, or 
may not, be used to carry one or more a recombinant genes according to any 
aspect of the present invention. 

One can determine whether a plasmid retains multicopy stability using test such as 
defined in Chinery & Hinchliffe (1989, Curr. Genet, 16, 21-25). For yeast that 
do not grow in the non-selective media (YPD, also designated YEPD) defined in 
Chinery & Hinchliffe (1989, Curr. Genet, 16, 21-25) other appropriate non- 
selective media might be used. Plasmid stability may be defined as the percentage 
cells remaining prototrophic for the selectable marker after a defined number of 
generations. The number of generations will preferably be sufficient to show a 
difference between a control plasmid, such as pSAC35 or pSAC310, or to shown 
comparable stability to such a control plasmid. The number of generations may 
be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 35, 40, 
45, 50, 60, 70, 80, 90, 100 or more. Higher numbers are preferred. The 
acceptable plasmid stability might be 1%, 2%, 3%, 4%, 5%, 10%, 15%, 20%, 
25%, 30%, 40%, 50%, 60%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 
99%, 99.9% or substantially 100%. Higher percentages are preferred. The skilled 
person will appreciate that, even though a plasmid may have a stability less than 
100% when grown on non-selective media, that plasmid can still be of use when 
cultured in selective media. For example plasmid pDB2711 as described in the 
examples is only 10% stable when the stability is determined accordingly to test of 
Example 2 of WO 2005/061719, but provides a 15-fold increase in recombinant 
transferrin productivity in shake flask culture under selective growth conditions. 

Thus one or more gene insertions may, or may not, occur between the first base 
after the last functional codon of the REP2 gene and the last base before the FRT 
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site in an inverted repeat adjacent to said gene, preferably between the first base of 
the inverted repeat and the last base before the FRT site, more preferably at a 
position after the translation termination codon of the REP2 gene and before the 
last base before the FRT site. 

Additionally or alternatively one or more gene insertions may, or may not, occur 
between the first base after the last functional codon of the FLP gene and the last 
base before the FRT site in an inverted repeat adjacent to said gene, preferably 
between the first base of the inverted repeat and the last base before the FRT site, 
more preferably between the first base after the end of the FLP coding sequence 
and the last base before the FRT site, such as at the first base after the end of the 
FLP coding sequence. 

In one preferred embodiment, where the 2um-family plasmid is based on the 2um 
plasmid of S. cerevisiae, it is a disintegration vector as known in the art (for 
example, see EP 286 424, the contents of which are incorporated herein by 
reference). A disintegration vector may, or may not, be a 2um plasmid vector 
comprising a DNA sequence which is intended to be lost by recombination, three 
2um FRT sites, of which one pair of sites is in direct orientation and the other two. 
pairs are in indirect orientation, and a DNA sequence of interest (such as an E. coli 
origin of replication and bacterial selectable marker), the said sequence to be lost 
being located between the said sites which are in direct orientation. 

Thus, the sequence to be lost may, or may not, comprise a selectable marker DNA 
sequence. 

A preferred disintegration vector may, or may not, comprise a complete 2um 
plasmid additionally carrying (i) a bacterial plasmid DNA sequence necessary for 
propagation of the vector in a bacterial host; (ii) an extra 2um FRT site; and a 
selectable marker DNA sequence for yeast transformation; the said bacterial 
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plasmid DNA sequence being present and the extra FRT site being created at a 
restriction site, such as Xbal, in one of the two inverted repeat sequences of the 
2\un plasmid, the said extra FRT site being in direct orientation in relation to the 
endogenous FRT site of the said one repeat sequence, and the bacterial plasmid 
DNA sequence being sandwiched between the extra FRT site and the endogenous 
FRT site of the said one repeat sequence. In a preferred disintegration vector, all 
bacterial plasmid DNA sequences may, or may not, be sandwiched as said. A 
particularly preferred 2^im plasmid vector has substantially the configuration of 
pSAC3 as shown in EP 286 424. 

The term "disintegration vector" as used herein also includes plasmids as defined 
in US 6,451,559, the contents of which are incorporated herein by reference. Thus 
a disintegration vector may, or may not, be a 2 jam vector that, other than DNA 
sequence encoding non-yeast polypeptides, contains no bacterial (particularly E. 
coli) origin of replication, or more preferably no bacterial (particularly E. coli) 
sequence and preferably all DNA in said vector, other than DNA sequence 
encoding non-yeast polypeptides, is yeast-derived DNA. 

Desired yroteins and other proteins defined b v the present application: 

The terms "protein" and "desired protein" as used herein includes all natural and 
non-natural proteins, polypeptides and peptides. For the purposes of the present 
invention, a "heterologous protein" is a protein that is encoded by a "recombinant 
gene" as described above. The "heterologous protein" may, or may not, be 
identical in sequence to a protein that is encoded by one of more other genes that 
naturally occur in the expression system that is used (by "expression system" we 
include the meaning of a host cell's genome (typically the chromosome) where the 
"recombinant gene" is chromosomally integrated, or a plasmid where the 
"recombinant gene" is encoded by a plasmid). For example, in the context of a 
"heterologous protein" that is encoded by a "recombinant gene" carried on a 2|am- 
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family plasmid, the "heterologous protein" may, or may not, be a protein that is 
not naturally encoded by a 2um-family plasmid and can also be described as a 
"non 2um-family plasmid protein". For convenience, the terms "heterologous 
protein" and "non 2um-family plasmid protein" are used synonymously in this 
application. Optionally therefore, when encoded by a 2um-family, the 
heterologous protein is not a FLP, REP1, REP 2, or a RAF/D protein as encoded 
by any one of pSRl, pSB3 or pSB4 as obtained from Z rouxii, pSBl or pSB2 
both as obtained from Z. bailli, pSMl as obtained from Z. fermentati, pKDl as 
obtained from K. drosophilarum, pPMl as obtained from P. membranaefaciens or 
the 2um plasmid as obtained from S. cerevisiae. 

A gene encoding a desired heterologous, or other, protein comprises a 
polynucleotide sequence encoding the heterologous protein (typically according to 
standard codon usage for any given organism), designated the open reading frame 
("ORF"). The gene may, or may not, additionally comprise some polynucleotide 
sequence that does not encode an open reading frame (termed "non-coding 
region"). 

Non-coding region in the gene may, or may not, contain one or more regulatory 
sequences, operatively linked to the ORF, which allow for the transcription of the 
open reading frame and/or translation of the resultant transcript. 

The term "regulatory sequence" refers to a sequence that modulates (i.e., promotes 
or reduces) the expression (i.e., the transcription and/or translation) of an ORF to 
which it is operably linked. Regulatory regions typically include promoters, 
terminators, ribosome binding sites and the like. The skilled person will 
appreciate that the choice of regulatory region will depend upon the intended 
expression system. For example, promoters may, or may not, be constitutive or 
inducible and may, or may not, be cell- or tissue-type specific or non-specific. 
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Suitable regulatory regions, may, or may not., be 5bp, lObp, 15bp, 20bp, 25bp, 
30bp, 35bp, 40bp, 45bp, 50bp, 60bp, 70bp, 80bp, 90bp, lOObp, 120bp, 140bp, 
160bp, 180bp, 200bp, 220bp, 240bp, 260bp : 280bp, 300bp, 350bp, 400bp, 450bp, 
500bp, 550bp, 600bp, 650bp, 700bp, 750bp, 800bp ; 850bp, 900bp, 950bp, 
lOOObp, llOObp, 1200bp, 1300bp, 1400bp, 1500bp or greater, in length. 

Those skilled in the art will recognise that the gene encoding a chaperone, for 
example PDI, may, or may not, additionally comprise non-coding regions and/or 
regulatory regions. Such non-coding regions and regulatory regions are not 
restricted to the native non-coding regions and/or regulatory regions normally 
associated with the chaperone ORF. 

Where the expression system is yeast, such as Saccharomyces cerevisiae, suitable 
promoters for 5". cerevisiae include those associated with the PGK1 gene, GAL1 or 
GALJO genes, TEF1, TEF2, PYK1, PMA1, CYC1, PH05, TRP1, ADH1, ADH2, 
the genes for glyceraldehyde-3 -phosphate dehydrogenase, hexokinase, pyruvate 
decarboxylase, phosphofructokinase, triose phosphate isomerase, phospho glucose 
isomerase, glucokinase, ot-mating factor pheromone, a-mating factor pheromone, 
the PRB1 promoter, the PRA1 promoter, the GPD1 promoter, and hybrid 
promoters involving hybrids of parts of 5' regulatory regions with parts of 5' 
regulatory regions of other promoters or with upstream activation sites (e.g. the 
promoter ofEP-A-258 067). 

Suitable transcription termination signals are well known in the art. Where the 
host cell is eucaryotic, the transcription termination signal is optionally derived 
from the 3* flanking sequence of a eucaryotic gene, which contains proper signals 
for transcription termination and polyadenylation. Suitable 3' flanking sequences 
may, or may not, for example, be those of the gene naturally linked to the 
expression control sequence used, i.e. may, or may not, correspond to the 
promoter. Alternatively, they may be different. In that case, and where the host is 
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a yeast, optionally S. cerevisiae, then the termination signal of the S. cerevisiae 
ADH1, ADH2, CYC], or PGK1 genes are preferred. 

It may, or may not, be beneficial for the promoter and open reading frame of the 
5 gene, such as a gene encoding the chaperone (e.g. PD11) or a desired protein (such 
as a heterologous desired protein), to be flanked by transcription termination 
sequences so that the transcription termination sequences are located both 
upstream and downstream of the promoter and open reading frame, in order to 
prevent transcriptional read-through into neighbouring genes, such as 2um genes, 
10 • and vice versa. 

In one embodiment, the favoured regulatory sequences in yeast, such as 
Saccharomyces cerevisiae, include: a yeast promoter (e.g. the Saccharomyces 
cerevisiae PRB1 promoter), as taught in EP 431 880: and a transcription 
15 terminator, optionally the terminator from Saccharomyces ADH1, as taught in EP 
60 057. Optionally, the vector incorporates at least two translation stop codons. 

It may, or may not, be beneficial for the non-coding region to incorporate more 
than one DNA sequence encoding a translational stop codon, such as UAA, UAG 
20 or UGA, in order to minimise translational read-through and thus avoid the 
production of elongated, non-natural fusion proteins. The translation stop codon 
UAA is preferred. 

The term "operably linked" includes within its meaning that a regulatory sequence 
25 is positioned within any non-coding region in a gene such that it forms a 
relationship with an ORF that permits the regulatory region to exert an effect on 
the ORE in its intended manner. Thus a regulatory region "operably linked" to an 
ORE is positioned in such a way that the regulator}' region is able to influence 
transcription and/or translation of the ORE in the intended manner, under 
30 conditions compatible with the regulatory sequence. 
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In one preferred embodiment, the desired protein (such as the heterologous 
desired protein) is secreted. In that case, a sequence encoding a secretion leader 
sequence which, for example, comprises most of the natural HSA secretion leader, 
plus a small portion of the S. cerevisiae a-mating factor secretion leader as taught 
in WO 90/01 063 may, or may not, be included in the open reading frame. 

Alternatively, the desired protein (such as a heterologous desired protein) may, or 
may not, be intracellular. 

The desired protein (such as a heterologous desired protein) may, or may not, 
comprise the sequence of a eucaryotic protein, or a fragment or variant thereof. 
Suitable eucaryotes include fungi, plants and animals. In one embodiment the 
heterologous protein may, or may not, be a fungal protein, such as a yeast protein. 
In another preferred embodiment the desired protein (such as a heterologous 
desired protein) may, or may not, be an animal protein. Exemplary animals 
include vertebrates and invertebrates. Exemplary vertebrates include mammals, 
such as humans, and non-human mammals. 

Thus the desired protein (such as a heterologous desired protein) may, or may not, 
comprise the sequence of a yeast protein. It may, or may not, for example, 
comprise the sequence of a yeast protein from the same host from which a 2^im- 
family plasmid is derived, particularly if the gene encoding the heterologous 
protein is integrated into said 2|^m-family plasmid. Those skilled in the art will 
recognise that a method, use or plasmid of the invention may, or may not, 
comprise DNA sequences encoding more than one heterologous protein, more 
than one chaperone, or more than one heterologous protein and more than one 
chaperone. 
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In another embodiment, the desired protein (such as a desired heterologous 
protein) may, or may not, comprise the sequence of albumin, a monoclonal 
antibody, an etoposide, a serum protein (such as a blood clotting factor), 
antistasin, a tick anticoagulant peptide, transferrin, lactoferrin, endostatin, 

5 angiostatin, coliagens, immunoglobulins or immunoglobulin-based molecules or 
fragment of either (e.g. a Small Modular ImmunoPharmaceutical™ ("SMIP") or 
dAb, Fab' fragments, F(ab')2, scAb, scFv or scFv fragment), a Kunitz domain 
protein (such as those described in WO 03/066824, with or without albumin 
fusions), interferons, interleukins, IL10, IL11, IL2, interferon a species and sub- 

10 species, interferon p species and sub-species, interferon y species and sub-species, 
leptin, CNTF, CNTFaxis, IL1 -receptor antagonist, erythropoietin (EPO) and EPO 
mimics, thrombopoietin (TPO) and TPO mimics, prosaptide, cyanovirin-N, 5- 
helix, T20 peptide, T1249 peptide, HIV gp41, HIV gpl20, urokinase, 
prourokinase, tPA, hirudin, platelet derived growth factor, parathyroid hormone, 

15 proinsulin, insulin, glucagon, glucagon-like peptides, insulin-like growth factor, 
calcitonin, growth hormone, transforming growth factor (3, tumour necrosis factor, 
G-CSF, GM-CSF, M-CSF, FGF, coagulation factors in both pre and active forms, 
including but not limited to plasminogen, fibrinogen, thrombin, pre-thrombin, pro- 
thrombin, von Willebrand's factor, a, -antitrypsin, plasminogen activators, Factor 

20 VII, Factor VIII, Factor IX, Factor X and Factor XIII, nerve growth factor, LACI, 
platelet-derived endothelial cell growth factor (PD-ECGF), glucose oxidase, 
serum cholinesterase, aprotinin, amyloid precursor protein, inter-alpha trypsin 
inhibitor, antithrombin III, apo-lipoprotein species, Protein C, Protein S, a 
metabolite, an antibiotic, or a variant or fragment of any of the above. 

25 

A "variant", in the context of the above-listed proteins, refers to a protein wherein at 
one or more positions there have been amino acid insertions, deletions, or 
substitutions, either conservative or non-conservative, provided that such changes 
result in a protein whose basic properties, for example enzymatic activity or receptor 
30 binding (type of and specific activity), thermostability, activity in a certain pH-range 
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(pH-stability) have not significantly been changed. "Significantly" in this context 
means that one skilled in the art would say that the properties of the variant may still 
be different but would not be unobvious over the ones of the original protein. 

By "conservative substitutions" is intended combinations such as Val, lie. Leu, Ala, 
Met; Asp, Glu: Asn, Gin; Ser 5 Thr, Gly, Ala; Lys 5 Arg, His; and Phe, Tyr, Trp. 
Preferred conservative substitutions include Gly, Ala; Val, lie. Leu; Asp, Glu; Asn, 
Gin; Ser, Thr; Lys, Arg: and Phe, Tyr. 

A "variant" typically has at least 25%, at least 50%, at least 60% or at least 70%, 
preferably at least 80%, more preferably at least 90%, even more preferably at least 
95%, yet more preferably at least 99%, most preferably at least 99.5% sequence 
identity to the polypeptide from which it is derived. 

The percent sequence identity between two polypeptides may be determined using 
suitable computer programs, for example the GAP program of the University of 
Wisconsin Genetic Computing Group and it will be appreciated that percent 
identity is calculated in relation to polypeptides whose sequence has been aligned 
optimally. 

The alignment may alternatively be carried out using the Clustal W program 
(Thompson et al, (1994) Nucleic Acids Res., 22(22), 4673-80). The parameters 
used may, or may not, be as follows: 

• Fast pairwise alignment parameters: K-tuple(word) size; 1, window size; 5, 
gap penalty; 3, number of top diagonals; 5. Scoring method: x percent. 

• Multiple alignment parameters: gap open penalty; 10, gap extension penalty; 
0.05. 

• Scoring matrix: BLOSUM. 
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Such variants may, or may not, be natural or made using the methods of protein 
engineering and site-directed mutagenesis as are well known in the art. 

A "fragment' \ in the context of the above-listed proteins, refers to a protein wherein 
5 at one or more positions there have been deletions. Thus the fragment may, or may 
not comprise at most 5, 10, 20, 30, 40 or 50% of the complete sequence of the full 
mature polypeptide. Typically a fragment comprises up to 60%, more typically up to 
70%, preferably up to 80%, more preferably up to 90%, even more preferably up to 
95%, yet more preferably up to 99% of the complete sequence of the full desired 
10 protein. Particularly preferred fragments of a protein comprise one or more whole 
domains of the protein. 

In one particularly preferred embodiment the desired protein (such as a desired 
heterologous protein) comprises the sequence of albumin or a variant or fragment 
15 thereof. 

By "albumin" we include a protein comprising the sequence of an albumin protein 
obtained from any source. Typically the source is mammalian. In one preferred 
embodiment the serum albumin is human serum albumin ("HSA"). The term 

20 "human serum albumin" includes the meaning of a serum albumin having an 
amino acid sequence naturally occurring in humans, and variants thereof. 
Optionally the albumin has the amino acid sequence disclosed in WO 90/13653 or 
a variant thereof. The HSA coding sequence is obtainable by known methods for 
isolating cDNA corresponding to human genes, and is also disclosed in, for 

25 example, EP 73 646 and EP 286 424. 

In another preferred embodiment the "albumin" comprises the sequence of bovine 

serum albumin. The term "bovine serum albumin" includes the meaning of a 

serum albumin having an amino acid sequence naturally occurring in cows, for 

30 example as taken from Swissprot accession number P02769, and variants thereof 

as defined below. The term "bovine serum albumin" also includes the meaning of 
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fragments of full-length bovine serum albumin or variants thereof, as defined 
below. 

In another preferred embodiment the albumin comprises the sequence of an 

5 albumin derived from one of serum albumin from dog (e.g. see Swissprot 
accession number P49822), pig (e.g. see Swissprot accession number P08835), 
goat (e.g. as available from Sigma as product no. A2514 or A4164), turkey (e.g. 
see Swissprot accession number 073860), baboon (e.g. as available from Sigma 
as product no. A1516), cat (e.g. see Swissprot accession number P49064), chicken 

10 (e.g. see Swissprot accession number P19121), ovalbumin (e.g. chicken 
ovalbumin) (e.g. see Swissprot accession number P01012), donkey (e.g. see 
Swissprot accession number P39090), guinea pig (e.g. as available from Sigma as 
product no. A3060, A2639, 05483 or A6539), hamster (e.g. as available from 
Sigma as product no. A5409), horse (e.g. see Swissprot accession number 

15 P35747), rhesus monkey (e.g. see Swissprot accession number Q28522), mouse 
(e.g. see Swissprot accession number 089020), pigeon (e.g. as defined by Khan et 
al 9 2002, Int. J. Biol MacromoL, 30(3-4),171-8), rabbit (e.g. see Swissprot 
accession number P49065), rat (e.g. see Swissprot accession number P36953) and 
sheep (e.g. see Swissprot accession number PI 4639) and includes variants and 

20 fragments thereof as defined below. 

Many naturally occurring mutant forms of albumin are known. Many are 
described in Peters, (1996, All About Albumin: Biochemistiy, Genetics and Medical 
Applications, Academic Press, Inc., San Diego, California, p. 170-1 81). A variant as 
25 defined above may, or may not, be one of these naturally occurring mutants. 

A "variant albumin" refers to an albumin protein wherein at one or more positions 

there have been amino acid insertions, deletions, or substitutions, either conservative 

or non-conservative, provided that such changes result in an albumin protein for 

30 which at least one basic property, for example binding activity (type of and specific 

activity e.g. binding to bilirubin), osmolality (oncotic pressure, colloid osmotic 
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pressure), behaviour in a certain pH-range (pH- stability) has not significantly been 
changed. "Significantly 5 ' in this context means that one skilled in the art would say 
that the properties of the variant may still be different but would not be unobvious 
over the ones of the original protein. 

5 

By "conservative substitutions" is intended combinations such as Gly, Ala; Val ? He. 
Leu: Asp, Glu; Asn, Gin; Ser, Thr; Lys, Arg; and Phe, Tyr. Such variants may, or 
may not, be made by techniques well known in the art, such as by site-directed 
mutagenesis as disclosed in US Patent No 4,302,386 issued 24 November 1981 to 
l o Stevens, incorporated herein by reference. 

Typically an albumin variant will have more than 40%, usually at least 50%, more 
typically at least 60%, preferably at least 70%, more preferably at least 80%, yet 
more preferably at least 90%, even more preferably at least 95%, most preferably at 

15 least 98% or more sequence identity with naturally occurring albumin. The percent 
sequence identity between two polypeptides may be determined using suitable 
computer programs, for example the GAP program of the University of Wisconsin 
Genetic Computing Group and it will be appreciated that percent identity is 
calculated in relation to polypeptides whose sequence has been aligned optimally. 

20 The alignment may alternatively be carried out using the Clustal W program 
(Thompson et ah, 1994). The parameters used may, or may not, be as follows: 

Fast pairwise alignment parameters: K-tuple(word) size; 1, window size; 5, gap 
penalty; 3, number of top diagonals; 5. Scoring method: x percent. Multiple 
25 alignment parameters: gap open penalty; 10, gap extension penalty; 0.05. Scoring 
matrix: BLOSUM. 

The term "fragment" as used above includes any fragment of full-length albumin 

or a variant thereof, so long as at least one basic property, for example binding 

30 activity (type of and specific activity e.g. binding to bilirubin), osmolarity (oncotic 

pressure, colloid osmotic pressure), behaviour in a certain pH-range (pH-stability) 
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has not significantly been changed. "Significantly" in this context means that one 
skilled in the art would say that the properties of the variant may still be different but 
would not be unobvious over the ones of the original protein. A fragment will 
typically be at least 50 amino acids long. A fragment may, or may not, comprise 
5 at least one whole sub-domain of albumin. Domains of HS A have been expressed 
as recombinant proteins (Dockal, M. et ah, 1999 ; ./. Biol Chem., 274, 29303- 
29310), where domain I was defined as consisting of amino acids 1-197, domain 
II was defined as consisting of amino acids 189-385 and domain III was defined 
as consisting of amino acids 381-585. Partial overlap of the domains occurs 

10 because of the extended a-helix structure (hlO-hl) which exists between domains 
I and IL and between domains II and III (Peters, 1996, op. cit., Table 2-4). HSA 
also comprises six sub-domains (sub-domains IA, IB, IIA, IIB, II1A and IIIB). 
Sub-domain IA comprises amino acids 6-105, sub-domain IB comprises amino 
acids 120-177, sub-domain IIA comprises amino acids 200-29 L sub-domain IIB 

15 comprises amino acids 316-369, sub-domain IIIA comprises amino acids 392-491 
and sub-domain IIIB comprises amino acids 512-583. A fragment may, or may 
not, comprise a whole or part of one or more domains or sub-domains as defined 
above, or any combination of those domains and/or sub-domains. 

20 In another particularly preferred embodiment the desired protein (such as a desired 
heterologous protein) comprises the sequence of transferrin or a variant or 
fragment thereof. The term "transferrin 55 as used herein includes all members of 
the transferrin family (Testa, Proteins of iron metabolism, CRC Press, 2002; 
Harris & Aisen, Iron carriers and iron proteins, VoL 5, Physical Bioinorganic 

25 Chemistry, VCH, 1991) and their derivatives, such as transferrin, mutant 
transferrins (Mason et al s 1993, Biochemistry, 32, 5472; Mason et al 9 1998, 
Biochem. J., 330(1), 35), truncated transferrins, transferrin lobes (Mason et al, 
1996, Protein Expr. Purif, 8, 119; Mason et al 9 1991, Protein Expr. Purif, 2, 
214), lactoferrin, mutant lactoferrins, truncated lactoferrins, lactoferrin lobes or 

30 fusions of any of the above to other peptides, polypeptides or proteins (Shin et, al 9 
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1995, Proc. Natl. Acad. Sci. USA, 92, 2820; Ali et al, 1999, J. Biol. Chem., 274, 
24066; Mason et al, 2002, Biochemistiy, 41, 9448). 

The transferrin may, or may not, be human transferrin. The term "human 
5 transferrin" is used herein to denote material which is indistinguishable from 
transferrin derived from a human or which is a variant or fragment thereof. A 
"variant" includes insertions, deletions and substitutions, either conservative or 
non-conservative, where such changes do not substantially alter the useful ligand- 
binding or immunogenic properties of transferrin. 

10 

Mutants of transferrin are included in the invention. Such mutants may, or may 
not, have altered immunogenicity. For example, transferrin mutants may, or may 
not, display modified (e.g. reduced) glycosylation. The N-linked glycosylation 
pattern of a transferrin molecule can be modified by adding/removing amino acid 
15 glycosylation consensus sequences such as N-X-S/T, at any or all of the N, X, or 
S/T position. Transferrin mutants may, or may not, be altered in their natural 
binding to metal ions and/or other proteins, such as transferrin receptor. An 
example of a transferrin mutant modified in this manner is exemplified below. 

20 We also include naturally-occurring polymorphic variants of human transferrin or 
human transferrin analogues. Generally, variants or fragments of human 
transferrin will have at least 5%, 10%, 15%, 20%, 30%, 40% or 50% (preferably 
at least 80%, 90% or 95%) of human transferrin's ligand binding activity (for 
example iron-binding), weight for weight. The iron binding activity of transferrin 

25 or a test sample can be determined spectrophotometrically by 470nm:280nm 
absorbance ratios for the proteins in their iron-free and fully iron-loaded states. 
Reagents should be iron-free unless stated otherwise. Iron can be removed from 
transferrin or the test sample by dialysis against 0.1M citrate, 0.1M acetate, lOmM 
EDTA pH4.5. Protein should be at approximately 20mg/mL in lOOmM HEPES, 

30 lOmM NaHC0 3 pH8.0. Measure the 470nm:280nm absorbance ratio of apo- 

transferrin (Calbiochem, CN 'Biosciences, Nottingham, UK) diluted in water so 
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10 



that absorbance at 280nm can be accurately determined spectrophotometries 
(0% iron binding). Prepare 20mM iron-nitrilotriacetate (FeNTA) solution by 
dissolving 191mg nitrotriacetic acid in 2mL 1M NaOH, then add 2mL 0.5M ferric 
chloride. Dilute to 50mL with deionised water. Fully load apo-transferrin with 
iron (100% iron binding) by adding a sufficient excess of freshly prepared 20mM 
FeNTA, then dialyse the holo-transferrin preparation completely against lOOmM 
HEPES, lOmM NaHC0 3 pHS.O to remove remaining FeNTA before measuring 
the absorbance ratio at 470nm:280nm. Repeat the procedure using test sample, 
which should initially be free from iron, and compare final ratios to the control. 

Additionally, single or multiple heterologous fusions comprising any of the above; 
or single or multiple heterologous fusions to albumin, transferrin or 
immunoglobins or a variant or fragment of any of these may, or may not, be used. 
Such fusions include albumin N-terminal fusions, albumin C-terminal fusions and 
15 co-N-terminal and C-terminal albumin fusions as exemplified by WO 01/79271, 
and transferrin N-terminal fusions, transferrin C-terminal fusions, and co-N- 
terminal and C-terminal transferrin fusions. 

Examples of transferrin fusions are given in US patent applications 
20 US2003/0221201 and US2003/0226155, Shin, et ah, 1995, Proc Natl Acad Sci 
USA, 92, 2820, Ali, et ah, 1999, J Biol Chem, 274, 24066, Mason, et aL, 2002, 
Biochemistiy, 41, 9448, the contents of which are incorporated herein by 
reference. 

25 The skilled person will also appreciate that the open reading frame of any other 
gene or variant, or part or either, can be utilised as an open reading frame for use 
with the present invention. For example, the open reading frame may, or may not, 
encode a protein comprising any sequence, be it a natural protein (including a 
zymogen), or a variant, or a fragment (which may, or may not, for example, be a 

30 domain) of a natural protein; or a totally synthetic protein; or a single or multiple 

fusion of different proteins (natural or synthetic). Such proteins can be taken, but 
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not exclusively, from the lists provided in WO 01/79258, WO 01/79271, WO 
01/79442, WO 01/79443, WO 01/79444 and WO 01/79480, or a variant or 
fragment thereof: the disclosures of which are incorporated herein by reference. 
Although these patent applications present the list of proteins in the context of 
fusion partners for albumin, the present invention is not so limited and, for the 
purposes of the present invention, any of the proteins listed therein may, or may 
not, be presented alone or as fusion partners for albumin, the Fc region of 
immunoglobulin, transferrin, lactoferrin or any other protein or fragment or 
variant of any of the above, as a desired polypeptide. 

The desired protein (such as a desired heterologous protein) may, or may not, be a 
therapeutically active protein. In other words, it may, or may not, have a 
recognised medical effect on individuals, such as humans. Many different types 
of therapeutically active protein are well known in the art. 

The desired protein (such as a desired heterologous protein) may, or may not, be a 
protein that is useful in diagnostic techniques. Many different types of 
diagnostically useful protein are well known in the art. 

The desired protein (such as a desired heterologous protein) may, or may not, be a 
protein that has no relationship to healthcare. It may, or may not, for example, be 
a protein that has a utility as an industrial, domestic or nutritional (e.g. as a 
foodstuff or additive) agent. Many different types of proteins having industrial, 
domestic and/or nutritional utilities are also well known in the art. 

The desired protein (such as a desired heterologous protein) may, or may not, 
comprise a leader sequence effective to cause secretion in a host cell, such as in a 
yeast cell. 

Numerous natural or artificial polypeptide signal sequences (also called secretion 

pre regions) have been used or developed for secreting proteins from host cells. 
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The signal sequence directs the nascent protein towards the machinery of the cell 
that exports proteins from the cell into the surrounding medium or, in some cases, 
into the periplasmic space. The signal sequence is usually, although not 
necessarily, located at the N-terminus of the primary translation product and is 
5 generally, although not necessarily, cleaved off the protein during the secretion 
process, to yield the £; mature ,; protein. 

In the case of some proteins the entity that is initially secreted, after the removal 
of the signal sequence, includes additional amino acids at its N-terminus called a 
10 "pro" sequence, the intermediate entity being called a "pro-protein". These pro 
sequences may, or may not, assist the final protein to fold and become functional, 
and are usually then cleaved off. In other instances, the pro region simply 
provides a cleavage site for an enzyme to cleave off the pre-pro region and is not 
known to have another function. 

15 

The pro sequence can be removed either during the secretion of the protein from 
the cell or after export from the cell into the surrounding medium or periplasmic 
space. 

20 Polypeptide sequences which direct the secretion of proteins, whether they 
resemble signal (i.e. pre) sequences or pre-pro secretion sequences, are referred to 
as leader sequences. The secretion of proteins is a dynamic process involving 
translation, translocation and post-translational processing, and one or more of 
these steps may not necessarily be completed before another is either initiated or 

25 completed. 

For production of proteins in eucaryotic species such as the yeasts Saccharomyces 
cerevisiae, Zygosaccharomyces species, Kluy\>eromyces lactis and Pichia pastor is, 
known leader sequences include those from the S. cerevisiae acid phosphatase 
30 protein (Pho5p) (see EP 366 400), the invertase protein (Suc2p) (see Smith et al 
(1985) Science, 229, 1219-1224) and heat-shock protein-150 (HsplSOp) (see WO 
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95/33833). Additionally, leader sequences from the S. cerevisiae mating factor 
alpha- 1 protein (MFcc-1) and from the human lysozyme and human serum 
albumin (HSA) protein have been used, the latter having been used especially, 
although not exclusively, for secreting human albumin. WO 90/01063 discloses a 

5 fusion of the MFot-1 and HSA leader sequences, which advantageously reduces 
the production of a contaminating fragment of human albumin relative to the use 
of the MFa-1 leader sequence. Modified leader sequences are also disclosed in 
WO 2004/009819 and in the examples of this application; the reader will 
appreciate that those leader sequences can be used with proteins other than 

10 transferrin. In addition, the natural transferrin leader sequence may, or may not, 
be used to direct secretion of transferrin and other heterologous proteins. 

Where a chaperone that is recombinantly expressed according to the present 
invention is protein disulphide isomerase, then optionally the desired protein (such 
15 as a desired heterologous protein) may, or may not, comprise disulphide bonds in 
its mature form. Any disulphide bonds may, or may not, be intramolecular and/or 
intermolecular. 

The desired protein (such as a desired heterologous protein) may, or may not, be a 
20 commercially useful protein, such as a therapeutically, diagnostically, industrially, 
domestically or nutritionally useful protein. Some proteins, such as 
heterologously expressed proteins, are intended to interact with the cell in which 
they are expressed in order to bring about a beneficial effect on the cell's 
activities. These proteins are not, in their own right, commercially useful. 
25 Commercially useful proteins are proteins that have a utility ex vivo of the cell in 
which they are expressed. Nevertheless, the skilled reader will appreciate that a 
commercially useful protein may, or may not, also have a biological effect on the 
host cell expressing it (such as a heterologous protein), but that that effect is not 
the main or sole reason for expressing the protein therein. 

30 
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Commercially useful proteins may include proteins that are useful as metabolites 
or antibiotics, and the like. 

In one embodiment it is preferred that the desired protein (such as a desired 
5 heterologous protein) is not P-lactamase. In another embodiment it is preferred 
that the desired protein (such as a desired heterologous protein) is not antistasin. 
However, the reader will appreciate that neither of these provisos exclude genes 
encoding either p-lactamase or antistasin from being present in a host cell or on a 
plasmid of the invention, merely that the gene encoding the desired protein (such 
10 as a desired heterologous protein) encodes a protein other than [^-lactamase and/or 
antistasin. 

Plasmids useful in the practice of the present invention can, unless specified 
otherwise, be any type of plasmid. For the purposes of the present invention, 
15 references to "plasmids' 5 may, or may not, also include a reference to other types of 
vectors. It may be appropriate to choose a suitable plasmid based on the host cell 
system in which it will be used. 

Many plasmids and other vectors are known for the transformation of various 
20 expression systems, including systems employing: bacteria (e.g. Bacillus subtilis or 
Escherichia coli) transformed with, for example, recombinant bacteriophage, 
plasmid or cosmid DNA expression vectors; yeasts (e.g. Saccharomyces cerevisiae 
or Pichia pastoris) transformed with, for example, yeast expression vectors; insect 
cell systems transformed with, for example, viral expression vectors (e.g. 
25 baculovirus); plant cell systems transfected with, for example viral or bacterial 
expression vectors; animal cell systems, either in cell culture, transgenic or as gene 
therapy, transfected with, for example, adenovirus expression vectors. 

Typical procaryotic vector plasmids are: pUC18, pUC19, pBR322 and pBR329 
30 available from Biorad Laboratories (Richmond, CA, USA); p7rc99A, pKK223-3, 
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pKK233-3 5 pDR540 and pRIT5 available from Pharmacia (Piscataway, NJ, USA); 
pBS vectors, Phagescript vectors, Bluescript vectors, pNH8A, pNH16A, pNH18A, 
pNH46A available from Stratagene Cloning Systems (La Jolla, CA 92037, USA). 

5 A typical mammalian cell vector plasmid is pSVL available from Pharmacia 
(Piscataway, NJ, USA). This vector uses the SV40 late promoter to drive expression 
of cloned genes, the highest level of expression being found in T antigen-producing 
cells, such as COS-1 cells. An example of an inducible mammalian expression 
vector is pMSG, also available from Pharmacia (Piscataway, NJ, USA). This vector 

10 uses the glucocorticoid-inducible promoter of the mouse mammary tumour virus 
long terminal repeat to drive expression of the cloned gene. 

Useful yeast plasmid vectors include the 2]uim-family plasmids (as described above), 
as well as pRS403-406 and pRS413-416 which are generally available from 
15 Stratagene Cloning Systems (La Jolla, CA 92037, USA). Plasmids pRS403, 
pRS404, pRS405 and pRS406 are Yeast Integrating plasmids (Yips) and incorporate 
. the yeast selectable markers HIS3 } TRF1, LEU2 and URA3. Plasmids pRS413-416 
are Yeast Centromere plasmids (YCps). Other Yips and YCps plasmids may also be 
used. 

20 

Plasmids for use in any aspect of the present invention can be prepared by modifying 
plasmids, such as 2|im-family plasmids, known in the art by inserting the required 
sequences (for example, one or more genes encoding chaperones and/or one or more 
genes encoding a heterologous protein) using techniques well known in the art such 
25 as are described in by Sambrook ei al, Molecular Cloning: A Laboratory Manual, 
2001, 3rd edition, the contents of which are incorporated herein by reference. For 
example, one such method involves ligation via cohesive ends. Compatible 
cohesive ends can be generated on a DNA fragment for insertion and plasmid by the 
action of suitable restriction enzymes. These ends will rapidly anneal through 
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complementary base pairing and remaining nicks can be closed by the action of 
DNA ligase. 

A further method uses synthetic double stranded oligonucleotide linkers and 
5 adaptors. DNA fragments with blunt ends are generated by bacteriophage T4 DNA 
polymerase or Kcoli DNA polymerase I which remove protruding 3' termini and fill 
in recessed 3' ends. Synthetic linkers and pieces of blunt-ended double-stranded 
DNA, which contain recognition sequences for defined restriction enzymes, can be 
ligated to blunt-ended DNA fragments by T4 DNA ligase. They are subsequently 
10 digested with appropriate restriction enzymes to create cohesive ends and ligated to 
an expression vector with compatible termini. Adaptors are also chemically 
synthesised DNA fragments which contain one blunt end used for ligation but which 
also possess one preformed cohesive end. Alternatively a DNA fragment or DNA 
fragments can be ligated together by the action of DNA ligase in the presence or 
15 absence of one or more synthetic double stranded oligonucleotides optionally 
containing cohesive ends. 

Synthetic linkers containing a variety of restriction endonuclease sites are 
commercially available from a number of sources including Sigma-Genosys Ltd, 
20 London Road, Pampisford, Cambridge, United Kingdom. 

Appropriate insertion sites in plasmids (such as 2|am-family plasmids) include, but 
are not limited to, those discussed above. 

25 Host cells 

The present invention also provides a host cell comprising recombinant genes 
and/or plasmid according to any aspect of the present invention. The host cell 
may be any type of cell. Many suitable host cell expression systems are known, 
30 including bacteria (for example E. coli and Bacillus subtilis), yeasts (for example 
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Saccharomyces cerevisiae, Pichia pastoris and Kluyveromyces lactis), filamentous 
fungi (for example Aspergillus), plant cells, whole plants, animal. cells and insect 
cells. Bacterial and yeast host cells may, or may not, be preferred. Bacterial host 
cells may be useful for cloning purposes. Yeast host cells may be useful for 
expression of genes present in the plasmid. 

In one embodiment the host cell may, or may not be a yeast cell, such as a 
member of the Saccharomyces, Kluyveromyces, Arxula, Yarrowia, Candida, 
Schizosaccharomyces, Debaryomyces, Xanthophyllomyces, Geotrichum, Ashbya, 
Hortaea, Schwanniomyces, Trichosporon, Xanthophyllomyces, or Pichia genus. 
Yeast such Saccharomyces cerevisiae, Kluyveromyces lactis, Pichia pastoris, 
Pichia membranaefaciens, Zygosaccharomyces rouxii, Zygosaccharomyces bailii, 
Zygosaccharomyces fermentati, Kluyveromyces drosphilarum, Pichia 
methanolica, Hansenula polymorpha (also known as Pichia augusta), Arxula 
adeninivorans, Yarrowia lipolytics Candida boidinii Candida utilis, 
Schizosaccharomyces pombe may, or may not, be preferred. Other suitable yeast 
may, or may not, include Debaryomyces hansenii, Xanthophyllomyces 
dendrorhous, Geotrichum candidum, Ashbya gossypii, Hortaea wemeckii, 
Schwanniomyces occidentalis, Trichosporon domesticum, and/or 
Xanthophyllomyces dendrorhous, 

It is may, or may not, be particularly advantageous to use a yeast deficient in one 
or more protein mannosyl transferases involved in O-glycosylation of proteins, for 
instance by disruption of the gene coding sequence, as discussed in WO 
2004/083245, the contents of which are incorporated herein by reference. 

In another embodiment the host cell may, or may not, be an animal cell. For 
example, the animal cell may, or may not, be a mammalian cell, such as a human 
cell type. 
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The host cell type may, or may not, be selected for compatibility with a plasmid 
type being used. Plasmids obtained from one yeast type can be maintained in 
other yeast types (lrie et al 9 1991, Gene, 108(1), 139-144; Irie et al 9 1991, Mol 
Gen. Genet., 225(2), 257-265). For example, pSRl from Zygosaccharomyces 
5 ronxii can be maintained in Saccharomyces cerevisiae. Optionally, the host cell is 
compatible with a 2^im-family plasmid (see above for a full description of the 
following plasmids). For example, where the plasmid is based on pSRl, pSB3 or 
pSB4 then a suitable yeast cell is Zygosaccharomyces rouxii; where the plasmid is 
based on pSBl or pSB2 then a suitable yeast cell is Zygosaccharomyces bailli; 

10 where the plasmid is based on pSMl then a suitable yeast cell is 
Zygosaccharomyces fermentati; where the plasmid is based on pKDl then a 
suitable yeast cell is Kluyveromyces drosophilarum; where the plasmid is based on 
pPMl then a suitable yeast cell is Pichia membranaefaciens; where the plasmid is 
based on the 2\xm plasmid then a suitable yeast cell is Saccharomyces cerevisiae 

15 or Saccharomyces carlsbergensis. It is particularly preferred that the plasmid is 
based on the 2^im plasmid and the yeast cell is Saccharomyces cerevisiae. 

A 2|am-family plasmid of the invention can be said to be "based on" a naturally 
occurring plasmid if it comprises one, two or preferably three of the genes FLP, 
20 REP J and REP 2 having sequences derived from that naturally occurring plasmid. 

It may, or may not, be particularly advantageous to use a yeast deficient in one or 
more protein mannosyl transferases involved in O-glycosylation of proteins, for 
instance by disruption of the gene coding sequence. 

25 

Recombinantly expressed proteins can be subject to undesirable post-translational 
modifications by the producing host cell. For example, the albumin protein 
sequence does not contain any sites for N-linked glycosylation and has not been 
reported to be modified, in nature, by Olinked glycosylation. However, it has. 
30 been found that recombinant human albumin ("rHA") produced in a number of 
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yeast species can be modified by O-linked glycosylation, generally involving 
mannose. The mannosylated albumin is able to bind to the lectin Concanavalin A. 
The amount of mannosylated albumin produced by the yeast can be reduced by 
using a yeast strain deficient in one or more of the PMT genes (WO 94/04687). 
5 The most convenient way of achieving this is to create a yeast which has a defect 
in its genome such that a reduced level of one of the Pmt proteins is produced. 
For example, there may, or may not, be a deletion, insertion or transposition in the 
coding sequence or the regulatory regions (or in another gene regulating the 
expression of one of the PMT genes) such that little or no Pmt protein is produced. 
10 Alternatively, the yeast could be transformed to produce an anti-Pmt agent, such 
as an anti-Pmt antibody. 

If a yeast other than S. cerevisiae is used, disruption of one or more of the genes 
equivalent to the PMT genes of S. cerevisiae is also beneficial, e.g. in Pichia 
15 pastoris or Kluyveromyces lactis. The sequence of PMT1 (or any other PMT 
gene) isolated from S. cerevisiae may, or may not, be used for the identification or 
disruption of genes encoding similar enzymatic activities in other fungal species. 
The cloning of the PMT1 homologue of Kluyveromyces lactis is described in WO 
94/04687. 

20 

The yeast may, or may not, have a deletion of the HSP150 and/or YAP3 genes as 
taught respectively in WO 95/33833 and WO 95/23857. 

A plasmid as defined above, may, or may not, be introduced into a host through 
25 standard techniques. With regard to transformation of procaryotic host cells, see, for 
example, Cohen et al (1972) Proc. Natl Acad. Set USA 69, 2110 and Sambrook et 
al (2001) Molecular Cloning, A Laboratory Manual, 3 rd Ed. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY. Transformation of yeast cells is described in 
Sherman et al (1986) Methods In Yeast Genetics, A Laboratory Manual, Cold Spring 
30 Harbor, NY. The method of Beggs (1978) Nature 275, 104-109 is also useful. 
Methods for the transformation of S. cerevisiae are taught generally in EP 251 



WO 2006/067511 



PCT/GB2005/005085 



744, EP 258 067 and WO 90/01063, all of which are incorporated herein by 
reference. With regard to vertebrate cells, reagents useful in transfecting such cells, 
for example calcium phosphate and DEAE-dextran or liposome formulations, are 
available from Stratagene Cloning Systems, or Life Technologies Inc., Gaithersburg, 
MD 20877, USA. 

Electroporation is also useful for transforming cells and is well known in the art for 
transforming yeast cell, bacterial cells and vertebrate cells. ' Methods for 
transformation of yeast by electroporation are disclosed in Becker & Guarente 
(1 990) Methods Enzymol 194, 1 82. 

Generally, where a plasmid is used, it will transform not all of the hosts and it will 
therefore be necessary to select for transformed host cells. Thus, a plasmid may, or 
may not, comprise a selectable marker, including but not limited to bacterial 
selectable marker and/or a yeast selectable marker. A typical bacterial selectable 
marker is the P -lactamase gene although many others are known in the art. Yeast 
selectable marker include LEU2, TRP1, HIS3, HIS4, URA3, URA5, SFA1, ADE2, 
METIS, LYS5, LYS2, ILV2, FBA1, PSEJ, PDI1 and PGKL Those skilled in the 
art will appreciate that any gene whose chromosomal deletion or inactivation 
results in an inviable host, so called "essential" genes, can be used as a selective 
marker if a functional gene is provided on the plasmid, as demonstrated for PGK1 
in apgkl yeast strain (Piper and Curran, 1990, Curr. Genet. 17, 119). Suitable 
"essential" genes can be found within the Stanford Genome Database (SGD), 
http:://db.yeastgenome.org). Any "essential" gene product (e.g. a product of one 
of the PDI1, PSE1, PGK1 or FBA1 genes, and others described elsewhere in this 
application) which, when deleted or inactivated, does not result in an auxotrophic 
(biosynthetic) requirement, can be used as a selectable marker on a plasmid in a 
host cell that, in the absence of the plasmid, is unable to produce that gene 
product, to achieve increased plasmid stability without the disadvantage of 
requiring the cell to be cultured under specific selective conditions. By 
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"auxotrophic (biosynthetic) requirement' 5 we include a deficiency which can be 
complemented by nutrient and other additions or modifications to the growth 
medium. Cells unable to express functional Pgklp or Fbalp can, however, be 
complemented by certain additions to growth media and such gene products may 

5 not be preferred "essential proteins" according to the present invention. 
Therefore, preferred "essential marker genes" in the context of the present 
invention are those that, when deleted or inactivated in a host cell, result in a 
deficiency which cannot be complemented by any additions or modifications to 
the growth medium, expect where those additions or modifications are, for 

10 example, a polynucleotide, that can restore the ability of the host cell to express 
the product of the "essential" marker gene, or the product of the "essential" 
marker gene itself. 

Accordingly, a plasmid as provided by, for use in a method of, or comprised in a 
15 host cell of, the present invention may, or may not, comprise more than one 
selectable marker. 

One selection technique involves incorporating into the expression vector a DNA 
sequence marker, with any necessary control elements, that codes for a selectable 
20 trait in the transformed cell. These markers include dihydrofolate reductase, G41 8 or 
neomycin resistance for eucaryotic cell culture, and tetracyclic kanamycin or 
ampicillin (i.e. p-lactamase) resistance genes for culturing in E.coli and other 
bacteria. Alternatively, the gene for such selectable trait can be on another vector, 
which is used to co-transform the desired host cell. 

25 

Another method of identifying successfully transformed cells involves growing the 
cells resulting from the introduction of the plasmid, optionally to allow the 
expression of a recombinant polypeptide (i.e. where a polypeptide which is encoded 
by a polynucleotide sequence on the plasmid and is not naturally produced by the 
30 host). Cells can be harvested and lysed and their DNA or RNA content examined for 

84 



WO 2006/067511 



PCT/GB2005/005085 



the presence of the recombinant sequence using a method such as that described by 
Southern (1975) J. Mol Biol 98, 503 or Berent el al (1985) Biotech 3, 208 or other 
methods of DNA and RNA analysis common in the art. Alternatively, the presence 
of a polypeptide in the supernatant of a culture of a transformed cell can be detected 
5 using antibodies. 

In addition to directly assaying for the presence of recombinant DNA. successful 
transformation can be confirmed by well known immunological methods when the 
recombinant DNA is capable of directing the expression of the protein. For example, 
10 cells successfully transformed with an expression vector produce proteins displaying 
appropriate antigenicity. Samples of cells suspected of being transformed are 
harvested and assayed for the protein using suitable antibodies. 

Thus, in addition to the transformed host cells themselves, the present invention also 
15 contemplates a culture of those cells, optionally a monoclonal (clonally 
homogeneous) culture, or a culture derived from a monoclonal culture, in a nutrient 
medium. Alternatively, transformed cells may, or may not, represent an 
industrially/commercially or pharmaceutically useful product and can be used 
without further purification or can be purified from a culture medium and optionally 
20 formulated with a carrier or diluent in a manner appropriate to their intended 
industrial/commercial or pharmaceutical use, and optionally packaged and presented 
in a manner suitable for that use. For example, whole cells could be immobilised; or 
used to spray a cell culture directly on to/into a process, crop or other desired target. 
Similarly, whole cell, such as yeast cells can be used as capsules for a huge variety of 
25 applications, such as fragrances, flavours and pharmaceuticals. 

Transformed host cells may, or may not, be cultured for a sufficient time and 
under appropriate conditions known to those skilled in the. art, and in view of the 
teachings disclosed herein, to permit the expression of one or more recombinant 
30 chaperones and a desired protein (such as a desired heterologous protein). 
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The culture medium may, or may not, be non-selective or may, or may not place a 
selective pressure on the maintenance of the plasmid. 

The thus produced desired protein (such as a desired heterologous protein) may, or 
5 may not, be present intracellular^ or, if secreted, in the culture medium and/or 
periplasmic space of the host cell. 

Protein Recovery and Formulation 

10 The step of purifying the thus expressed desired protein (such as a desired 
heterologous protein) from the cultured host cell or the culture medium optionally 
comprises cell immobilization, cell separation and/or cell breakage, but always 
comprises at least one other purification step different from the step or steps of 
cell immobilization, separation and/or breakage. 

15 

Cell immobilization techniques, such as encasing the cells using calcium alginate 
bead, are well known in the art. Similarly, cell separation techniques, such as 
centrifugation, filtration (e.g. cross-flow filtration, expanded bed chromatography 
and the like are well known in the art. Likewise, methods of cell breakage, 
20 including beadmilling, sonication, enzymatic exposure and the like are well 
known in the art. 

The at least one other purification step may be any other step suitable for protein 
purification known in the art. For example purification techniques for the 

25 recovery of recombinantly expressed albumin have been disclosed in: WO 
92/04367, removal of matrix-derived dye; EP 464 590, removal of yeast-derived 
colorants; EP 319 067, alkaline precipitation and subsequent application of the 
albumin to a lipophilic phase; and WO 96/37515, US 5 728 553 and WO 
00/44772, which describe complete purification processes; all of which are 

30 incorporated herein by reference. 
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Proteins other than albumin may be purified from the culture medium by any 
technique that has been found to be useful for purifying such proteins. 

Suitable methods include ammonium sulphate or ethanol precipitation, acid or 
5 solvent extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity chromatography, 
hydroxylapatife chromatography, lectin chromatography, concentration, dilution, pH 
adjustment, diafiltration, ultrafiltration, high performance liquid chromatography 
("HPLC"), reverse phase HPLC, conductivity adjustment and the like. 

10 

In one embodiment, any one or more of the above mentioned techniques may, or 
may not, be used to further purify the thus isolated protein to a commercially or 
industrially acceptable level of purity. By commercially or industrially acceptable 
level of purity, we include the provision of the protein at a concentration of at least 

15 0.01 g.L/ 1 , 0.02 g.U 1 , 0.03 g.U 1 , 0.04 g.U 1 , 0.05 g.L-\0.06 g.U 1 ,0.07 g.U 1 , 0.08 g.U 
\ 0.09 g.L" 1 , 0.1 g.U 1 , 0.2 g.L" 1 , 0.3 g.L" 1 , 0.4 g.L" 1 , 0.5 g.L" 1 , 0.6 g.L" 1 , 0.7 g.L" 1 , 0.8 
g.L-', 0.9 g.U 1 , 1 g.U\ 2 g.L" 1 , 3 g.L" 1 , 4 g.L" 1 , 5 g.L" 1 , 6 g.L" 1 , 7 g.U 1 , 8 g.L" 1 , 9 g.U 
', 10 g.U 1 , 15 g.U 1 , 20 g.U 1 , 25 g.U 1 , 30 g.U 1 , 40 g.U 1 ,50 g.U 1 , 60 g.U 1 , 70 g.U 1 , 
70 g.U 1 , 90 g.L-', 100 gU" 1 , 150 g.U 1 , 200 g.U 1 ,250 g.U 1 , 300 g.U 1 , 350 g.U 1 , 400 

20 g.U 1 , 500 g.U 1 , 600 g.U 1 , 700 g.U 1 , 800 g.U 1 , 900 g.U 1 , 1000 g.U 1 , or more. 

It is preferred that the desired protein (such as a desired heterologous protein) is 
purified to achieve a pharmaceutically acceptable level of purity. A protein has a 
pharmaceutically acceptable level of purity if it is essentially pyrogen free and can 
25 be administered in a pharmaceutically efficacious amount without causing medical 
effects not associated with the activity of the protein. 

The resulting desired protein (such as a desired heterologous protein) may, or may 
not. be used for any of its known utilities, which, in the case of albumin, include 
30 i.v. administration to patients to treat severe bums, shock and blood loss, 
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supplementing culture media, and as an excipient in formulations of other 
proteins. 

Although it is possible for a therapeutically, diagnostically, industrially, 
5 domestically or nutritionally useful desired protein (such as a desired heterologous 
protein) obtained by a process of the of the invention to.be presented or administered 
alone, it is preferable to present it as a formulation (such as a pharmaceutical 
formulation, particularly in the case of therapeutically and/or diagnostically useful 
proteins), together with one or more acceptable carriers or diluents. The carrier(s) or 
10 diluent(s) must be "acceptable" in the sense of being compatible with the desired 
protein and, where the formulation is intended for administration to a recipient, then 
not deleterious to the recipient thereof. Typically, the carriers or diluents will be 
water or saline which will be sterile and pyrogen free. 

15 Optionally the thus formulated protein will be presented in a unit dosage form, 
such as in the form of a tablet, capsule, injectable solution or the like. 

In a sixth aspect of the present invention there is provided a method for producing 
a desired protein (such as a desired heterologous protein), such as a desired 

20 protein as defined above for an earlier aspect of the present invention, comprising: 
providing a host cell comprising a first recombinant gene encoding the protein 
comprising the sequence of Orm2p or a variant thereof and a second gene, 
optionally a second recombinant gene, encoding a desired protein (such as a 
desired heterologous protein), optionally with the proviso that the first and second 

25 genes are not both present within the host cell on the same 2um-family plasmid; 
and culturing the host cell in a culture medium under conditions that allow the 
expression of the first and second genes. The method may, or may not, further 
comprise the step of purifying the thus expressed desired protein (such as a 
desired heterologous protein) from the cultured host cell or the culture medium; 

30 and optionally, lyophilising the thus purified protein; and optionally formulating 
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the purified desired protein (such as a desired heterologous protein) with a carrier 
or diluent; and optionally presenting the thus formulated protein in a unit dosage 
form. 

5 In the manner discussed above, the host cell may, or may not further comprise a 
further recombinant gene encoding a protein comprising the sequence of an 
alternative chaperone toOrm2p or a variant thereof. 

Either or both of the first and second genes in the sixth aspect of the invention 

10 may or may not be recombinant genes that are expressed from a plasmid, and 
optionally from the same plasmid, provided that where both genes are expressed 
from the same plasmid then that plasmid is not a 2|um -family plasmid. A further 
recombinant gene encoding a protein comprising the sequence of an alternative 
chaperone to Orm2p or a variant thereof may, or may not, also be expressed from 

15 a plasmid, optionally from the same plasmid as either or both of the first and 
second recombinant genes. Except where both of the first and second genes are 
recombinant genes that are co-expressed from the same plasmid then either one 
may, or may not, be individually expressed from a 2|am-family plasmid, such as 
the 2[xm plasmid. Alternatively, one or both of the first and second genes of the 

20 sixth aspect of the invention may, or may not, be integrated into the chromosome 
of the host cell. The further recombinant gene encoding a protein comprising the 
sequence of an alternative chaperone to Orm2p or a variant thereof may, or may 
not, be integrated into the chromosome of the host cell, irrespective of whether or 
not the first and second genes are expressed from a plasmid or are chromosomally 

25 integrated. 

The present invention also provides, in a seventh aspect, a host cell as defined 
above in respect of the sixth aspect, which host cell comprises a first recombinant 
gene encoding a protein comprising the sequence of Orm2p or a variant or 
30 fragment thereof and a second gene, such as a recombinant gene, encoding a 
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desired protein (such as a desired heterologous protein), optionally with the 
proviso that the first and second genes are not present within the host cell on the 
same 2|im-family plasmid. 

5 The present invention also provides, in an eighth aspect, for the use of a nucleic 
acid sequence encoding the protein Orm2p or a variant thereof to increase the 
production, in a host cell (such as a host cell as defined above), of a desired 
protein (such as a desired heterologous protein) encoded by a gene, such as a 
recombinant gene, in the host cell by co-expression of the nucleic acid sequence 

10 and the gene within the host cell (but optionally not including co-expression of 
these genes from the same 2^m-family plasmid). Either or both of the nucleic 
acid sequence and the gene encoding the desired protein may, or may not, be 
expressed from a plasmid within the host cell, and optionally from the same 
plasmid. In the manner discussed above, the host cell may, or may not, further 

15 comprise a recombinant gene encoding an alternative chaperone to Orm2p or a 
variant thereof, which may, or may not, be located on a plasmid within the host 
cell, optionally on the same plasmid as either or both of the nucleic acid sequence 
and a gene encoding the desired protein. Suitable plasmids include a 2|am-family 
plasmid, such as the 2[im plasmid, as discussed above. 

20 

In a ninth aspect of the present invention there is also provided the use of a 
plasmid as an expression vector to increase the production of a heterologous 
protein by providing a recombinant gene encoding the heterologous protein and a 
gene encoding Orm2p or a variant thereof on the same plasmid, optionally with 
25 the proviso that the plasmid is not a 2|am-family plasmid. The plasmid may, or 
may not, further comprise a gene encoding an alternative chaperone to Orm2p or a 
variant thereof in the manner discussed above. 

Accordingly, in a tenth aspect, the present invention also provides a plasmid, 
30 optionally an expression plasmid, comprising a first gene encoding the protein 
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Orm2p or a variant or fragment thereof and a second gene, encoding a 
heterologous protein, as discussed above, optionally with the proviso that the 
plasmid is not a 2um-family plasmid. The plasmid may, or may not, further 
comprise a third gene encoding an alternative chaperone to Orm2p or a variant 
5 thereof. In a preferred embodiment, the third gene encodes a protein comprising 
the sequence of protein disulphide isomerase. 

We have also demonstrated that a plasmid-borne gene encoding a protein 
comprising the sequence of an "essential" protein can be used to stably maintain 
10 the plasmid in a host cell that, in the absence of the plasmid, does not produce the 
"essential" protein. This has the advantage of ensuring the genetic stability of the 
organism in the chosen culture conditions, and thereby improving the 
reproducibility and reliability of individual cultures,- and furthermore enables 
prolonged culture without reduced productivity due to plasmid loss. 

15 

A preferred "essential" protein is a chaperone which may or may not provide the 
further advantage that, as well as acting as a selectable marker to increase plasmid 
stability, its expression simultaneously increases the expression of one or more 
desired proteins, such as a heterologous protein encoded by a recombinant gene, 

20 within the host cell. This system is advantageous because it allows the user to 
minimise the number of recombinant genes that need to be carried by a plasmid. 
For example, typical prior art plasmids carry marker genes (such as those as 
described above) that enable the plasmid to be stably maintained during host cell 
culturing process. Such marker genes need to be retained on the plasmid in 

25 addition to any further genes that are required to achieve a desired effect. 
However, the ability of plasmids to incorporate exogenous DNA sequences is 
limited and it is therefore advantageous to minimise the number of sequence 
insertions required to achieve a desired effect. Moreover, some marker genes 
(such as auxotrophic marker genes) require the culturing process to be conducted 

30 under specific conditions in order to obtain the effect of the marker gene. Such 
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specific conditions may not be optimal for cell growth or protein production, or 
may require inefficient or unduly expensive growth systems to be used. 

Thus, it is possible to use a recombinant gene that encodes a protein comprising 
the sequence of an "essential" protein as a plasmid-borne gene to increase plasmid 
stability, where the plasmid is present within a cell that, in the absence of the 
plasmid, is unable to produce the "essential" protein. It will be appreciated that 
the question of whether or not a protein is "essential" will depend on the system in 
which it is use; it is possible that a protein that is not "essential" in one host 
organism might become "essential" when one or more other genes is deleted, 
disrupted, inactivated, modified or affected in that same host, and thereby be used 
as an "essential" plasmid-borne gene, as described above; likewise whether or not 
a protein is "essential" may depend on certain physical conditions, such as pH, 
temperature and/or oxygen levels under which the host cell is cultured- 
It is preferred that the "essential protein" is one that, when its encoding gene(s) in a 
host cell are deleted or inactivated, does not result in the host cell developing an 
auxotrophic (biosynthetic) requirement. By "auxotrophic (biosynthetic) 
requirement" we include a deficiency that can be complemented by additions or 
modifications to the growth medium, in particular additions of, or modifications to, 
the nutrient composition of the growth medium. Thus, the "essential protein" 
would be an auxotrophic marker protein if the inactivation of its encoding gene, in 
a host cell, resulted in the production of an auxotrophic mutant, i.e. a mutant 
organism that, in order to grow and survive, requires a particular additional nutrient 
that the normal (unmutated strain) does not - it is preferred that the "essential 
protein" is not an auxotrophic marker protein. Therefore, an "essential marker 
gene" which encodes an "essential protein", in the context of the present invention 
is one that, when deleted or inactivated in a host cell, results in a deficiency which 
cannot be complemented by additions or modifications, typically nutrient additions 
or modifications, to the growth medium, expect where those additions or 

modifications are, for example, a polynucleotide, that can restore the ability of the 
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host cell to express the product of the -essential' 1 marker gene, or the product of the 
"essential" marker gene itself. In other words, it may, or may not, be preferred if 
the "essential protein" is not a protein that, in nature, is involved in the metabolic 
conversion of nutrients by a host cell. The advantage of this system is that the 
"essential marker gene" can be used as a selectable marker on a plasmid in host cell 
that, in the absence of the plasmid, is unable to produce that gene product, to 
achieve increased plasmid stability without the disadvantage of requiring the cell to 
be cultured under specific selective (e.g. selective nutrient) conditions. Therefore, 
the host cell can be cultured under conditions that do not have to be adapted for any 
particular marker gene, without losing plasmid stability. For example, host cells 
produced using this system can be cultured in non-selective media such as complex 
or rich media, and under non-selective growth conditions (e.g. such as pH, 
temperature and/or oxygen levels), which may be more economical, and/or more 
supportive growth media/conditions, than the minimal media and/or specifically 
adapted growth conditions that are commonly used to give auxotrophic, and other, 
marker genes their effect. 

The cell may, or may not, for example, have the endogenous copy (or copies) of the 
gene (or genes) encoding the "essential" protein deleted or otherwise inactivated. 

It is particularly preferred if the "essential protein" is an "essential" chaperone, as 
this can provide the dual advantage of improving plasmid stability without the need 
for selective growth conditions and increasing the production of desired proteins, 
such as endogenously encoded or a heterologous proteins, in the host cell. This 
system also has the advantage that it minimises the number of recombinant genes 
that need to be carried by the plasmid if one chooses to use over-expression of an 
"essential" chaperone to increase protein production by the host cell. 

Preferred "essential proteins" for use in this aspect of the invention include the 
"essential" chaperones encoded by the genes PDI1 and PSE1 which, when the 
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endogenous gene(s) encoding these proteins are deleted or inactivated in a host cell, 
do not result in the host cell developing an auxotrophic (biosynthetic) requirement. 

Preferred "essential" chaperones are eucaryotic chaperones, especially preferred 
"essential" chaperones are yeast chaperones, including chaperones comprising the 
sequence of proteins encoded by a gene selected from CCT2, CCT3, CCT4, CCT5, 
CCT6, CCT7, CCT8, CNSJ, EROl (in the absence of diamide), HSP10, HSP60, 
PDJ1, CDC37, KAR2, MGE1, MRS 11, NOBJ, SSCJ, PSE1, TIM9, PAM18 and 
TCP1. 

It is noted that a host cell that is mutated to inactive EROl can be complemented 
by growth in the presence of the oxidant diamide (Frand & Kaiser, 1998, 
Molecular Cell, 1, 161-170), but diamide is not a "nutrient" addition of the type 
discussed above in respect of auxotrophic mutations. Diamide is not a commonly 
used component of growth media and an EROl mutant that is transformed with a 
plasmid comprising the EROl gene can be grown in rich media without loss of 
plasmid stability. 

Accordingly, in an eleventh aspect, the present invention also provides a host cell 
comprising a plasmid (such as a plasmid as defined above by any of the previous 
aspects of the invention), the plasmid comprising a gene that encodes an 
"essential" protein, such as a chaperone, wherein, in the absence of the plasmid, 
the host cell is unable to produce the "essential" protein. Preferably, in the 
absence of the plasmid, the host cell is inviable. Typically the host cell has been 
genetically modified to render it unable to produce a functional copy of the 
"essential" protein from a chromosomally-encoded (or otherwise endogenous) 
gene. The host cell may, or may not, further comprise a recombinant gene 
encoding a heterologous protein, such as those described above in respect of 
earlier aspects of the invention. 
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The present invention also provides, in a twelfth aspect, a plasmid comprising, as 
the sole yeast selectable marker, optionally as the sole selectable marker, a gene 
encoding an "essential" protein, such as an "essential" chaperone. The plasmid 
may, or may not, further comprise a gene encoding a heterologous protein. The 
5 plasmid may, or may not, be a 2 urn-family plasmid. 

The present invention also provides, in a thirteenth aspect, a method for producing 
a desired protein (such as a desired heterologous protein) comprising the steps of: 
providing a host cell comprising a plasmid, the plasmid comprising a gene that 

10 encodes an "essential" protein, such as a chaperone, wherein, in the absence of the 
plasmid, the host cell is unable to produce the "essential" protein and wherein the 
host cell further comprises a gene, such as a recombinant gene, encoding a desired 
protein (such as a desired heterologous protein); culturing the host cell in a 
culture medium under conditions that allow the expression of the "essential" 

15 protein and the desired protein; and optionally purifying the thus expressed 
desired protein from the cultured host cell or the culture medium; and further 
optionally, lyophilising the thus purified protein. Thus, a host cell used in this 
method may, or may not, be a host cell according to the eleventh aspect of the 
invention and/or the host call may, or may not, be transformed with a plasmid 

20 according to the twelfth aspect of the invention. 

The method may, or may not, further comprise the step of formulating the purified 
desired protein (such as a desired heterologous protein) with a carrier or diluent 
and optionally presenting the thus formulated protein in a unit dosage form, in the 
25 manner discussed above. 

In one preferred embodiment, the step of "culturing the host cell in a culture 
medium under conditions that allow the expression of the "essential" protein and 
the desired protein" involves culturing the host cell in medium that is not 
30 specifically adapted to be selective for the presence of any genes on the plasmid. 
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other than for the presence of the gene encoding the "essential" protein. Thus, in 
one embodiment, the step of culturing the host cells may, or may not, be 
performed in non-selective media, such as complex or rich media and/or under 
conditions (such as pH, temperature and/or oxygen levels) that are not specifically 
adapted to select for the presence of the "essential" protein. A medium can be 
described as non-selective for the purposes of the present situation if it is not 
specifically adapted to deprive the host cell of a product, typically a nutrient 
product, that is ordinarily provided to maximise, or otherwise allow, the growth of 
host cells that have not been modified to prevent the expression of the "essential" 
protein. For example, a medium (the "test medium") may be a non-selective 
medium, for the purposes of the present invention if, when one compares plasmid 
stability in a first host cell type grown in the "test medium" to plasmid stability in 
a second cell type grown in the "test medium" when each cell type is grown for 5, 
10, 15, 20, 25 or 30 generations in the "test medium", wherein - 

(i) the first host cell type is a host cell according to the eleventh aspect of the 
present invention; 

(ii) the second host cell type is a host cell according to the eleventh aspect of 
the present invention except that it has been modified to restore the ability 
of the host cell to produce the "essential" protein in the absence of the 
plasmid (which is not to say that the second host cell type does not contain 
a plasmid encoding the "essential" protein, just that it can produce the 
"essential" protein even when the plasmid is not present); 

then the plasmid stability observed in the second cell type is less than 99%, 98%, 
95%, 90%, 80%, 70%, 60%, 50%, 40%, 30%, 20%, 15%, 10%, 5%, 4%, 3%, 2%, 
1% or less of the plasmid stability observed in the first cell type. Thus, in this 
embodiment, and despite being grown under non-selective conditions, it is 
preferable for method of the thirteenth aspect of the present invention to produce 
host cells which display substantially 100% plasmid stability after 5, 10, 15, 20, 
25 or 30 generations. 
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A fourteenth aspect of the present invention also provides for the use of a 
polynucleotide that encodes an "essential" protein (as defined above) to increase 
the stability of a plasmid in a host cell, particularly under non-selective conditions, 
by integration of the polynucleotide into the plasmid to produce a modified 
plasmid, wherein the host cell is unable to produce the "essential" protein in the 
absence of the modified plasmid. The increase in stability may, or may not, be at 
least 1% (i.e. 1.01 times), 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 20%, 30%, 
40%, 50%, 60%, 70%, 80%, 90%, 100% (i.e. 2-fold), 3-fold, 4-fold, 5-fold, 6-fold 
s, 7-fold, 8-fold, 9-fold, 10-fold or more greater than the level of stability of the 
unmodified plasmid in the same host cell that has been modified to produce the 
"essential" protein, when grown for at least five generations under non-selective 
conditions. 

In one embodiment of the fourteenth aspect of the present invention, the plasmid 
may, or may not, additionally comprise a gene that encodes a further desired 
heterologous protein, such as defined above in respect of earlier aspects of the 
present invention. In that case, the use may, or may not, be to improve the 
productivity of the desired heterologous protein, such as when the host cell 
comprising the plasmid is grown under non-selective conditions. 

In one preferred embodiment, the "essential" protein is a chaperone and may, or 
may not, be used to simultaneously increase the stability of a plasmid in the host 
cell and increase the ability of the host cell to produce a desired protein product. 
The desired protein product may, or may not, be an endogenously encoded protein 
or may, or may not, be a heterologous protein as defined by the earlier aspects of 
the invention, mere the protein product is a heterologous protein, it may, or may 
not, be encoded by a recombinant gene that has been integrated into the 
chromosome of the host cell, or by a gene that is present on a plasmid in the host 
cell, such as the modified plasmid comprising the polynucleotide that encodes the 
"essential" protein as defined above. 
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We have also found that the effects of recombinantly-provided chaperones 
according to the other embodiments of the present invention can be modulated by 
modifying the promoters that control the expression levels of the chaperone(s). 
Surprisingly we have found that, in some cases, shorter promoters result in 

5 increased expression of a desired protein. Without being bound by theory we 
believe that this is because the expression of a recombinant chaperone in host cells 
that already express desired proteins at high levels can cause the cells to overload 
themselves with desired protein (such as a desired heterologous protein), thereby 
achieving little or no overall increase in production of the desired protein. In 

10 those cases, it may, or may not, be beneficial to provide recombinant chaperone 
genes with truncated promoters. 

Accordingly, in a fifteenth aspect of the present invention there is provided a 
polynucleotide (such as a plasmid as defined above) comprising the sequence of a 

15 promoter operably connected to a coding sequence encoding a chaperone (such as 
those described above), for use in increasing the expression of a desired protein 
(such as a desired heterologous protein), such as those described above, in a host 
cell (such as those described above) by expression of the polynucleotide sequence 
within the host cell, wherein the promoter is characterised in that it achieves a 

20 modified, such as a higher or lower, level of expression of the chaperone than 
would be achieved if the coding sequence were to be operably connected to its 
naturally occurring promoter. 

The present invention also provides, in a sixteenth aspect, a method for producing 

25 a desired protein (such as a desired heterologous protein) comprising the steps of: 

providing a host cell comprising a recombinant gene that comprising the sequence 

of promoter operably connected to a coding sequence encoding a chaperone, the 

promoter being characterised in that it achieves a lower level of expression of the 

chaperone than would be achieved if the coding sequence were to be operably 

30 connected to its naturally occurring promoter, and the host cell further comprising 

a gene, such as a recombinant gene, encoding a desired protein (such as a desired 
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heterologous protein); culturing the host cell under conditions that allow the 
expression of the chaperone and the desired protein; and optionally purifying the 
thus expressed desired protein from the cultured host cell or the culture medium; 
and further optionally, lyophilising the thus purified protein; and optionally further 
formulating the purified desired protein with a carrier or diluent; and optionally 
presenting the thus formulated protein in a unit dosage form, in the manner 
discussed above. 

As is apparent from the examples of the present application, the combination of 
recombinantly expressed PDI and transferrin-based proteins provides a 
surprisingly high level of transferrin expression. For example, transferrin 
expression in a system that includes a chromosomally encoded recombinant PDI 
gene provided a 2-fold increase (compared to a control in which there is no 
chromosomally encoded recombinant PDI gene). This increase was 5-times 
greater than an equivalent system comprising a recombinant gene encoding human 
albumin in place of the recombinant transferrin gene. 

The host may be any cell type, such as a procaryotic cell (e.g. bacterial cells such 
as E. coli) or a eucaryotic cell. Preferred eucaryotic cells include fungal cells, 
such as yeast cells, and mammalian cells. Exemplar}' yeast cells are discussed 
above. Exemplary mammalian cells include human cells. 

Host cells as described above can be cultured to produce recombinant transferrin- 
based proteins. The thus produced transferrin-based proteins can be isolated from 
the culture and purified, optionally to a pharmaceutically acceptable level of 
purity, for example using techniques known in the art and/or as set out above. 
Purified transferrin-based proteins may, or may not, be formulated with a 
pharmaceutically acceptable carrier or diluent and may, or may not, be presented 
in unit dosage form. 
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The present invention will now be exemplified with reference to the following 
non-limiting examples and figures. 

BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 shows a map of a typical 2um plasmid. 

Figure 2 shows the results of rocket immunoelectrophoresis (RIE) determination 
of increased recombinant transferrin (N413Q, N611Q) secretion with PDI1 over- 
expression. Cryopreserved yeast stocks were grown for 4-days in lOmL BMMD 
shake flask cultures and supernatants were loaded at 5uL per well. Goat 
polyclonal anti-transferrin (human) antiserum (Calbiochem) was used at 40uL per 
rocket immunoelectrophoresis gel (50mL). A = Control strain [pSAC35], 
duplicate flasks: B = Control strain [pDB2536], duplicate flasks; C = Control 
strain [pDB2711], neat to 40-fold aqueous dilutions; D = Control strain 
[pDB2931], duplicate flasks; E = Control strain [pDB2929], neat to 40-fold 
aqueous dilutions. 

Figure 3 shows the results of RIE analysis of recombinant transferrin (N413Q, 
N611Q) secretion with and without PDI1 over-expression. Cryopreserved yeast 
stocks were grown for 4-days in lOmL BMMD shake flask cultures and 
supernatants were loaded at 5uL per well. Duplicate loadings were made of 
supernatants from two individual cultures of each strain. Goat polyclonal anti- 
transferrin (human) antiserum (Calbiochem) was used at 40uL per rocket 
immunoelectrophoresis gel (50mL). A = Control strain [pSAC35]; B = Control 
strain [pDB2536]; C = Control strain [pDB2711]; D = Control strain [pDB2931]; 
E = Control strain [pDB2929]. 

Figure 4 shows the results of SDS-PAGE analysis of recombinant transferrin 
secretion with and without PDU over-expression. BMMD shake flask cultures 
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were grown for 4-days and 10uL supernatant analysed on non-reducing SDS- 
PAGE (4-12% ttuPAGE®, MOPS buffer, InVitrogen) with GelCode® Blue 
reagent (Pierce). 1 = SeeBlue Plus2 Markers (InVitrogen). 2 = pDB2536; 3 = 
pDB2536: 4 = pDB2711; 5 = pDB2711; 6 = pDB2931; 7 = pDB2931; 8 = 
5 P DB2929; 9 = pDB2929; 1 0 = pSAC35 control. 

Figure 5 shows PJE analysis of recombinant transferrin secretion from 5. 
cerevisiae strains with an additional integrated copy of PDI1. 5-day BMMD shake 
flask culture supematants were loaded at 5mL per well. Strains contained: 1) 

10 pSAC35 (negative control); 2) pDB2536 (recombinant non-glycosylated 
transferrin (N413Q, N611Q)) or 3) pDB2506 (same as plasmid pDB2536 but the 
transferrin ORF encodes transferrin without the N-+Q mutations at positions 413 
and 61 1, i.e. recombinant glycosylated transferrin). Each well contained a sample 
derived from an individual transformant. Standards were human plasma holo- 

15 transferrin (Calbiochem) at 100, 50, 20, 10, 5 and 2mg.L" 1 . 

Figure 6 shows RIE analysis of recombinant transferrin secretion from Strain A 
[pDB2536] and Strain A [pDB2506] grown in shake flask culture. 5-day BMMD 
or YEPD shake flask culture supematants were loaded in duplicate at 5mL per 
20 well. 

Figure 7 shows SDS-PAGE analysis of recombinant transferrin secreted from 
Strain A [pDB2536] and Strain A [pDB2506] grown in shake flask culture. 
Cultures were grown for 5-days in BMMD and 30mL supematants analysed on 
SDS-PAGE (4-12% NuPAGE™, MOPS Buffer, InVitrogen) stained with 
GelCode, Blue Reagent (Pierce). 1) Strain A [pDB2536] transformant 1 ; 2) Strain 
A [pDB2536] transformant 2; 3) Strain A [pSAC35] control; 4) Strain A 
[pDB2506] transformant 1; 5) SeeBlue, Plus2 Protein Standards (approximate 
molecular weights only). 



25 



30 
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Figure 8 shows RIE of recombinant transferrin secreted from S. cerevisiae strains 
with different PD11 copy numbers. 3-day BMMD shake flask culture 
supernatants were loaded at 5mL per well. Goat polyclonal anti-transferrin 
(human) antiserum (Calbiochem) was used at 30mL per rocket 
5 Immunoelectrophoresis gel (50mL). (A) supernatant from S. cerevisiae control 
strain [pDB2711] or [pDB2712]; (B) supernatant from Strain A [pDB2536]; (C) 
supernatant from control strain [pDB2536]. 

Figure 9 shows SDS-PAGE analysis of recombinant transferrin secreted from S. 

10 cerevisiae strains with different PDI1 copy numbers. 4-12% NuPAGE reducing 
gel run with MOPS buffer (InVitrogen) after loading with 30mL of 3-day BMMD 
shake flask culture supernatant per lane; (lane 1) supernatant from control strain 
[pDB2536]; (lane 2) supernatant from Strain A [pDB2536]; (lanes 3-6) 
supernatant from control strain [pDB2711] or [pDB2712]; (lane 7) molecular 

1 5 weight markers (SeeBlue Plus2, InVitrogen). 

Figure 10 shows RIE of recombinant transferrin secreted from different S. 
cerevisiae strains with and without additional PDI1 gene co-expression. lOmL 
YEPD shake flasks were inoculated with yeast and incubated for 4-days at 30°C. 
20 5uL culture supernatant loaded per well of a rocket immunoelectrophoresis gel. 
Plasma Tf standards concentrations are in u.g/mL. 20uL goat anti-Tf / 50mL 
agragose. Precipin was stained with Coomassie blue. 

Figure 11 shows RIE analysis of rHA expression in different S. cerevisiae strains 
25 when co-expressed with PDI1 genes having different length promoters. lOmL 

YEPD shake flasks were inoculated with yeast and incubated for 4-days at 30°C. 

4uL culture supernatant loaded per well of a rocket immunoelectrophoresis gel. 

rHA standards concentrations are in ug/mL, 400uL goat anti-HA (Sigma product 

A-1151 resuspended in 5mL water) /50mL agarose. Precipin was stained with 
30 Coomassie blue. 
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Figure 12 shows RIE analysis of rHA expression in different S. cerevisiae strains 
when co-expressed with PDJ1 genes having different length promoters. lOmL 
YEPD shake flasks were inoculated with yeast and incubated for 4-days at 30°C. 
4uL culture supernatant loaded per well of a rocket Immunoelectrophoresis gel. 
rHA standards concentrations are in ug/mL. 400uL goat anti-HA (Sigma product 
A-1151 resuspended in 5mL water) /50mL agarose. Precipin was stained with 
Coomassie blue. 

Figure 13 shows RIE analysis of rHA fusion proteins with and without co- 
expressed recombinant PDI1. lOmL BMMD shake flasks were inoculated with 
YBX7 transformed with albumin fusion expression plasmids and incubated for 4- 
days at 30°C. 4uL culture supernatant loaded per well of a rocket 
Immunoelectrophoresis gel. rHA standards concentrations are in ug/mL. 200uL 
goat anti-HA (Sigma product A-1151 resuspended in 5mL water) /50mL agarose. 
Precipin was stained with Coomassie blue. 



Figure 14 shows SDS-PAGE analysis of recombinant albumin fusion secretion 
with and without PDI1 present on the expression plasmid. lOmL BMMD shake 
flasks were inoculated with yeast and incubated for 4-days at 30°C, 200rpm. 30uL 
supernatant analysed on non-reducing SDS-PAGE (4-12% NuPAGE®, MES 
buffer, InVitrogen) with GelCode® Blue reagent (Pierce). 1 ■= SeeBlue Plus2 
Markers (InVitrogen); 2 = lug rHA; 3 = angiostatin-rHA; 4 = angiostatin-rHA + 
PDlh 5 = endostatin-rHA; 6 = endostatin-rHA + PDJ1; 7 = DX-890-(GGS) 4 GG- 
25 rHA; 8 = DX-890-(GGS) 4 GG-rHA + PDI1; 9 = DPI-14-(GGS) 4 GG-rHA; 10 = 
DPI-]4-(GGS) 4 GG-rHA + PDI1; 11 = Axokine™ (CNTF Axl5 )-(GGS) 4 GG-rHA 
(Lambert et al, 2001, Proc. Natl. Acad. Sci. USA, 98, 4652-4657); 12 = Axokine™ 
(CNTFaxI s) -(GGS) 4 GG-rHA + PDI1 . 
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Figure 15 shows RIE analysis demonstrating increased transferrin secretion from 
S. cerevisiae with ORM2 co-expression from a 2um-based plasmid. Four day 
shake flask culture supernantants were loaded at 5ul per well. Standards were 
human plasma holo-transferrin (Calbiochem), at 25, 20, 15, 10, 5 ug/ml, loaded 
5ul per well. Goat polyclonal anti-transferrin (human) antiserum (Calbiochem) 
used at 20 ul per rocket immunoelectrophoresis gel (50 ml). 

Figure 16 shows RIE analysis demonstrating increased transferrin secretion from 
S. ^ cerevisiae with PSE1 co-expression from a 2um-based plasmid. Four day 
shake flask culture supernantants were loaded at 5ul per well. Standards were 
human plasma holo-transferrin (Calbiochem), at 25, 20, 15, 10, 5 ug/ml, loaded 
5ul per well. Goat polyclonal anti-transferrin (human) antiserum (Calbiochem) 
used at 20ul per rocket immunoelectrophoresis gel (50 ml). 

Figure 17 shows RIE analysis demonstrating increased transferrin secretion from 
S. "cerevisiae with SSA1 co-expression from a 2um-based plasmid. Four day shake 
flask culture supernantants were loaded at 5ul per well. Standards were human 
plasma holo-transferrin (Calbiochem), at 25, 20, 15, 10, 5 ug/ml, loaded 5ul per 
well. Goat polyclonal anti-transferrin (human) antiserum (Calbiochem) used at 
20ul per rocket immunoelectrophoresis gel (50 ml). 

Figure 18 shows the results of RIE. lOmL YEPD shake flasks were inoculated 
with DXY1 trplA [ P DB2976], DXY1 trplA [pDB2977], DXY1 trplA 
[pDB2978], DXY1 trplA [pDB2979], DXY1 trplA [pDB2980] or DXY1 trplA 
[pDB2981] transformed to tryptophan prototrophy with a 1.41kb NotVPstl 
pdil::TRPl disrupting DNA fragment was isolated from pDB3078. 
Transformants were grown for 4-days at 30°C, 200rpm. 4uL culture supernatant 
loaded per well of a rocket immunoelectrophoresis gel. rHA standards 
concentrations are in pg/mL. 700pL goat anti-HA (Sigma product A-1151 
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resuspended in 5mL water) /50mL agarose. Precipin was stained with Coomassie 
blue. Isolates selected for further analysis are indicated (*). 

Figure 19 shows the results of RIE. lOmL YEPD shake flasks were inoculated 
5 with DXY1 [pDB2244], DXY1 [pDB2976], DXY1 trplA pdil::TRPl 
[pDB2976], DXY1 [pDB2978], DXY1 trplA pdilr.TRPl [pDB2978], DXY1 
[pDB2980], DXY1 trplA pdilr.TRPl [pDB2980], DXY1 [pDB2977], DXY1 
trplA pdilr.TRPl [pDB2977], DXY1 [pDB2979] DXY1 trplA pdilr.TRPl 
[pDB2979], DXY1 [pDB2981] and DXY1 trplA pdilr.TRPl [pDB2981], and 
10 were grown for 4-days at 30°C, 200rpm. 4uL culture supernatant loaded per well 
of a rocket Immunoelectrophoresis gel. rHA standards concentrations are in 
|ag/mL. 800uL goat anti-HA (Sigma product A-l 151 resuspended in 5mL water) 
/50mL agarose. Precipin was stained with Coomassie blue. Isolates selected for 
further analysis are indicated (*) 



15 



20 



Figure 20 shows a sequence alignment of the SKQ2n and S288c gene sequences 
with long promoters, as described in Example 6. 

Figures 21 to 33 show various plasmid maps. 



Figure 34 shows Rocket Immunoelectrophoresis of YEPD shake flask culture 
supernatants from DXY1 and DXY1 Atipl pdilr.TRPl containing pDB3175 to 
pDB3178. lOmL YEPD shake flasks were inoculated with DXY1 [pAYE316], 
DXY1 [pDB3175], DXY1 [pDB3176], DXY1 [pDB3177], DXY1 [pDB3178], 
25 DXY1 Atrpl pdilr.TRPl [pDB3175], DXY1 Atrpl pdilr.TRPl [pDB3176], 
DXY1 Atipl pdilr.TRPl [pDB3177], and DXY1 Atrpl pdilr.TRPl [pDB3178] 
were grown for 4-days at 30°C, 200rpm. 5uL culture supernatant was loaded per 
well of a 50mL rocket Immunoelectrophoresis gel in triplicate. rHA standard 
concentrations were 300, 200, 150, 100 and 50 ug/mL. 600uL goat anti-HSA 
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(Sigma product A-1151 resuspended in 5mL water) per 50mL agarose gel. 
Precipitin was stained with Coomassie blue. 

Figure 35 shows Rocket Immunoelectrophoresis of YEPD shake flask culture 
supernatants from DXY1 and DXY1 Atrpl pdil::TRPl containing pDB3179 to 
pDB3182. lOmL YEPD shake flasks were inoculated with DXY1 [pDB2931], 
DXY1 [pDB3179], DXY1 [pDB3180], DXY1 [pDB3181], DXY1 [pDB3182], 
DXY1 Atrpl pdilr.TRPl [pDB3179], DXY1 Atrpl pdilr.TRPl [pDB3180], 
DXY1 Atrpl pdilr.TRPl [pDB3181], and DXY1 Atrpl pdilr.TRPl [pDB3182] 
were grown for 4-days at 30°C, 200rpm. 5uL culture supernatant was loaded per 
well of a 50mL rocket immunoelectrophoresis gel in triplicate. Plasma transferrin 
standard concentrations were 100, 50, 20, 10 and 5 ug/mL. 40uL goat polyclonal 
anti human transferrin antiserum (Calbiochem) was used per 50mL agarose gel. 
Precipitin was stained with Coomassie blue. 

EXAMPLES 

Two types of expression cassette have been used to exemplify secretion of a 
recombinant human transferrin mutant (N413Q, N61 1Q) from S. cerevisiae. One 
type uses a modified HSA(pre)/MFal(pro) leader sequence (named the "modified 
fusion leader" sequence). The second type of expression cassette uses only the 
modified HSA(pre) leader sequence. 

The 24 amino acid sequence of the "modified fusion leader" is 
MKWVFIVSILFLFSSAYSRSLDKR. 

The 18 amino acid sequence of the modified HSA(pre) leader sequence is 
MKWVFIVSILFLFSSAYS. 
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Transferrin (N413Q, N611Q) expression using these two cassettes has been 
studied in S. cerevisiae using the 2pm expression vector with and without an 
additional copy of the S. cerevisiae PD1 gene, PD11. 

EXAMPLE 1 

Construction of expression plasmids 

Plasmids pDB2515, pDB2529, pDB2536, pDB2688, pDB2690, pDB2711, 
P DB2921, pDB2928, pDB2929, pDB2930, pDB2931, pDB2932 and pDB2690 
were constructed as described in Example 1 of WO 2005/061718, the contents of 
which are incorporated herein by reference. 

EXAMPLE 2 
Expression of transferrin 

A S. cerevisiae control strain was transformed to leucine prototrophy with all the 
transferrin (N413Q, N61 1Q) expression plasmids, and cryopreserved stocks were 
prepared. 

Strains were grown for four days at 30°C in lOmL BMMD cultures in 50mL 
conical flasks shaken at 200rpm. The litres of recombinant transferrin secreted 
into the culture supernatants were compared by rocket immunoelectrophoresis 
(RIE as described in Weeke, B., 1976, "Rocket immunoelectrophoresis" In N. H. 
Azelsen, J. Kroll, and B. Weeke [eds.], A manual of quantitative 
immunoelectrophoresis. Methods and applications, Universitetsforlaget, Oslo, 
Norway), reverse phase high performance liquid chromatography (RP-HPLC) 
(Table 1), and non-reducing SDS polyacrylamide electrophoresis stained with 
colloidal Coomassie blue stain (SDS-PAGE). The increase in recombinant 
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transferrin secreted when S. cerevisiae PDJJ was over-expressed was estimated to 
be greater than 10-fold. 

By RP-HPLC analysis (using the method described in Example 2 of WO 
2005/061718, the contents of which are incorporated herein by reference) the 
increase in transferrin secretion was determined to be 18-fold for the modified 
fusion leader sequence and 15-fold for the modified HSA-pre leader sequence 
(Table 1). 

Figure 4 shows an SDS-PAGE comparison of the recombinant transferrin secreted 
by S. cerevisiae strains with and without additional PDI1 expression. 
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Table 1: 



Plasmid 


Secretory' 
Leader 


A /4 rl iti f\t\ 51 1 

PDll 


Average Transferrin Titre 
(^ig.mL- 1 ) (n=2) 


Estimated Increase due to 
Additional PDll 


pSAC35 


None 


No 


0.4 




pDB2536 


Fusion 
Leader 


No 


6.2 




pDB2711 


Fusion 
Leader 


Yes 


112.8 


18-fold 


pDB293 1 


Modified 
HSA-pre 
Leader 


No 


5.1 




pDB2929 


Modified 
HSA-pre 
Leader 


Yes 


| 76.1 


15- fold 



RIE analysis indicated that the increased transferrin secretion in the presence of 
5 additional copies of PDll was approximately 15-fold (Figure 2). By RIE analysis 
the increase appeared slightly larger for the modified HSA-pre leader sequence 
than for the modified fusion leader sequence (Figure 3). 
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EXAMPLE 3 



Chromosomal over-expression ofPDI 



S. cerevisiae Strain A was selected to investigate the secretion of recombinant 
glycosylated transferrin expression from plasmid pDB2506 and recombinant non- 
glycosylated transferrin (N413Q, N611Q) from plasmid pDB2536. Strain A has 
the following characteristics - 

additional chromosomally integrated PDU gene integrated at the host 
PDI1 chromosomal location. 

the URA3 gene and bacterial DNA sequences containing the ampicillin 
resistance gene were also integrated into the S. cerevisiae genome at the 
insertion sites for the above genes. 



A control strain had none of the above insertions. 



Control strain [cir°] and Strain A [cir°] were transformed to leucine prototrophy 
with pDB2506 (recombinant transferrin), pDB2536 (recombinant non- 
glycosylated transferrin (N413Q,. N611Q)) or pSAC35 (control). Transformants 
were selected on BMMD-agar. 

The relative level of transferrin secretion in BMMD shake flask culture was 
determined for each strain/plasmid combination by rocket Immunoelectrophoresis 
(RIE). Figure 5 shows that both strains secreted both the glycosylated and non- 
glycosylated recombinant transferrins into the culture supernatant. 

The levels of both the glycosylated and non-glycosylated transferrins secreted 
from Strain A [pDB2506] and Strain A [pDB2536] respectively, appeared higher 
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than the levels secreted from the control strain. Hence, at least in shake flask 
culture, PDU integrated into the host genome at the PDU locus in Strain A has 
enhanced transferrin secretion. 

Furthermore, the increase in transferrin secretion observed between control strain 
[pDB2536] and Strain A [pDB2536] appeared'to be at least a 100% increase by 
RIE. In contrast, the increase in rHA monomer secretion between control strain 
[pDB2305] and Strain A [pDB2305] was approximately 20% (data not shown). 
Therefore, the increase in transferrin secretion due to the additional copy of PDU 
in Strain A was surprising large considering that transferrin has 19 disulphide 
bonds, compared to rHA with 17 disulphide bonds. Additional copies of the PDU 
gene may be particularly beneficial for the secretion from S. cerevisiae of proteins 
from the transferrin family, and their derivatives. 

The levels of transferrin secreted from Strain A [pDB2536] and Strain A 
[pDB2506] were compared by RIE for transformants grown in BMMD and YEPD 
(Figure 6). Results indicated that a greater than 2-fold increase in titres of both 
non-glycosylated recombinant transferrin (N413Q, N611Q) and glycosylated 
recombinant transferrin was achieved by growth in YEPD (10-20 mg.L" 1 serum 
transferrin equivalent) compared to BMMD (2-5 mg.L" 1 serum transferrin 
equivalent). The increase in both glycosylated and non-glycosylated transferrin 
titre observed in YEPD suggested that both transferrin expression plasmids were 
sufficiently stable under non-selective growth conditions to allow the expected 
increased biomass which usually results from growth in YEPD .to be translated 
into increased glycosylated and non-glycosylated transferrin productivity. 

SDS-PAGE analysis of non-glycosylated transferrin (N413Q, N611Q) secreted 
from Strain A [pDB2536] and glycosylated transferrin from Strain A [pDB2506] 
grown in BMMD shake flask culture is shown in Figure 7. Strain A [pDB2536] 
samples clearly showed an additional protein band compared to the Strain A 
IpSAC35] control. This extra band migrated at the expected position for the 

111 



WO 2006/067511 



PCT/GB2005/005085 



recombinant transferrin (N413Q, N611Q) secreted from control strain [pDB2536]. 
Strain A [pDB2506] culture supematants appeared to contain a diffuse protein 
band at the position expected for transferrin. This suggested that the secreted 
recombinant transferrin was heterogeneous, possibly due to hyper-mannosylation 
5 at Asp4 1 3 and/or Asp6 1 1 . 



EXAMPLE 4 

Comparing transferrin secretion from S. cerevisiae control strain containing 
10 pDB2711 with transferrin secretion from S. cerevisiae Strain A 

Plasmid pDB2711 is as described above. Plasmid pDB2712 (Figure 22 of WO 
2005/06171 8) was also produced with the Notl cassette in the opposite direction to 
pDB2711. 

15 

Control strain S. cerevisiae [cir°] was transformed to leucine prototrophy with 
pDB2711 and pDB2712. Transformants were selected on BMMD-agar and 
cryopreserved trehalose stocks of control strain [pDB271 1] were prepared. 

20 Secretion of recombinant transferrin (N413Q, N611Q) by control strain 
[pDB2711], control strain [pDB2712], Strain A [pDB2536], control strain 
[pDB2536] and an alternative control strain [pDB2536] was compared in both 
BMMD and YEPD shake flask culture. RIE indicated that a significant increase 
in recombinant transferrin secretion had been achieved from control strain 

25 [pDB271 1 ] with multiple episomal PDI1 copies, compared to Strain A [pDB2536] 
with two chromosomal copies of PDIJ, and control strain [pDB2536] with a 
single chromosomal copy of PDI1 gene (Figure 8). Control strain [pDB2711] and 
control strain [pDB2712] appeared to secrete similar levels of rTf (N413Q, 
N61 1Q) into the culture media. The levels of secretion were relatively consistent 

30 between control strain [pDB271 1] and control strain [pDB2712] transformants in 
both BMMD and YEPD media, suggesting that plasmid stability was sufficient for 
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high-level transferrin secretion even under non-selective conditions. This is in 
contrast to the previous published data in relation to recombinant PDGF-BB and 
HSA where introduction of PDI1 into multicopy 2um plasmids was shown to be 
detrimental to the host. 

Table 2: Recombinant transferrin titres from high cell density fermentations 



Strain 


Supernatant (g.L 1 ) 


GP-HPLC 


SDS-PAGE 


Control [pDB2536] 


0.5/0.4 




Alternative control [pDB2536] 


1.5/1.6 


0.6 


0.9/0.9 


0.4/0.4/0.5 


Strain A [pDB2536] 


0.7 


0.6 




0.6 




Control [pDB2711] 


3.5 


3.6 


3.4 


2.7/3.1 



Reducing SDS-PAGE analysis of transferrin secreted from control strain 
[pDB2711], control strain [pDB2712], Strain A [pDB2536], control strain 
[pDB2536] and alternative, control strain [pDB2536] in BMMD shake flask 
culture is shown in Figure 9. This shows an abundant protein band in all samples 
from control strain [pDB2711] and control strain [pDB2712] at the position 
expected for transferrin (N413Q, N611Q). The relative stain intensity of the 
transferrin (N413Q, N611Q) band from the different strains suggested that Strain 
A [pDB2536] produced more than control strain [pDB2536] and alternative 
control strain [pDB2536], but that there was an even more dramatic increase in 
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secretion from control strain [pDB2711] and control strain [pDB2712]. The 
increased recombinant transferrin secretion observed was concomitant with the 
increased PDI1 copy number in these strains. This suggested that Pdilp levels 
were limiting transferrin secretion in control strain, Strain A and the alternative 
control strain, and that elevated PDI1 copy number was responsible for increased 
transferrin secretion. Elevated PDI1 copy number could increase the steady state 
expression level of PDI1 so increasing the amount of Pdilp activity. There are a 
number of alternative methods by which this could be achieved without increasing 
the copy number of the PDI1 gene, for example the steady state PDI1 mRNA 
level could be increased by either increasing the transcription rate, say by use of a 
higher efficiency promoter, or by reducing the clearance rate of the PDI1 mRNA. 
Alternatively, protein engineering could be used to enhance the specific activity or 
turnover number of the Pdilp protein. 

In high cell density fermentations control strain [pDB2711] recombinant 
transferrin (N413Q, N61 1Q) production was measured at approximately Sg.L" 1 by 
both GP-HPLC analysis and SDS-PAGE analysis (Table 2). This level of 
production is several fold-higher than control strain, the alternative control strain 
or Strain A containing pDB2536. Furthermore, for the production of proteins for 
therapeutic use in humans, expression systems such as control strain [pDB2711] 
have advantages over those using Strain A, as they do not contain bacterial DNA 
sequences. 

CONCLUSIONS 

Secretion of recombinant transferrin from a multicopy expression plasmid 
(pDB2536) was investigated in S. cerevisiae strains containing an additional copy 
of the PDI1 gene integrated into the yeast genome. Transferrin secretion was also 
investigated in S. cerevisiae transformed with a multicopy expression plasmid, in 
which the PDI1 gene has been inserted into the multicopy episomal transferrin 
expression plasmid (pDB271 1). 
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A S. cerevisiae strain with an additional copy of the PDI1 gene integrated into the 
genome at the endogenous PD11 locus, secreted recombinant transferrin and non- 
glycosylated recombinant transferrin (N413Q, N611Q) at an elevated level 

5 compared to strains containing a single copy of PDI1 . A further increase in PDI1 
copy number was achieved by using pDB271 1 In high cell density fermentation 
of the strain transformed with pDB2711, recombinant transferrin (N413Q, 
N611Q) was secreted at approximately 3g.L"\ as measured by SDS-PAGE. and 
GP-HPLC analysis. Therefore, increased PDI1 gene copy number has produced a 

10 large increase in the quantity of recombinant transferrins secreted from S. 
cerevisiae. 

The following conclusions are drawn - 

15 1. In shake flask analysis of recombinant transferrin expression from pDB2536 
(non-glycosylated transferrin (N413Q, N611Q) and pDB2506 (glycosylated 
transferrin) the S. cerevisiae strain Strain A secreted higher levels of both 
recombinant transferrins into the culture supernatant than control strains. This 
was attributed to the extra copy of PDI1 integrated at the PDI1 locus. 



20 



25 



2. Control strain [pDB2711], which contained, the PDI1 gene on the multicopy 
expression plasmid, produced a several-fold increase in recombinant transferrin 
(N413Q, N61 1Q) secretion compared to Strain A [pDB2536] in both shake flask 
culture and high cell density fermentation. 

3. Elevated PDI1 copy number in yeast such as S. cerevisiae will be 
advantageous during the production of desired proteins (such as a desired 
heterologous proteins), such as those from the transferrin family. 
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4. pSAC35-based plasmids containing additional copies of PDIJ gene 
advantages for the production of proteins from the transferrin family, and 
derivatives, such as fusions, mutants, domains and truncated forms. 



5 EXAMPLE 5 

Insertion of a PDI1 gene into a 2pm-like plasmid increased secretion of 
recombinant transferrin from various different S. cerevisiae strains 

10 The S. cerevisiae strain JRY1 88 cir + (National Collection of Yeast Cultures) and 
MT302/28B cir + (Finnis et al, 1993, Eur. J. Biochem., 212, 201-210) was cured of 
the native 2um plasmid by galactose induced over-expression of FLP from 
Yep351 -GAL-FLP1, as described by Rose and Broach (1990, Meth. EnzymoL, 
185. 234-279) to create the S. cerevisiae strains JRY188 cir° and MT302/28B cir°, 

15 respectively. 



25 



The S. cerevisiae strains JRY188 cir°, MT302/28B cir°, S150-2B cir° (Cashmore 
et al, 1986, MoL Gen. Genet., 203, 154-162), CB11-63 cir° (Zealey et al., 1988, 
Mol. Gen. Genet., 211, 155-159) were all transformed to leucine prototrophy with 
20 pDB2931 (Figure 14 of WO 2005/061718) and pDB2929 (Figure 12 of WO 
2005/061718). Transformants were selected on appropriately supplemented 
minimal media lacking leucine. Transformants of each strain were inoculated into 
lOmL YEPD in 50mL shake flasks and incubated in an orbital shaker at 30°C, 
200rpm for 4-days. Culture supematants were harvested and the recombinant 
transferrin titres compared by rocket Immunoelectrophoresis (Figure 10). The 
results indicated that the transferrin titres in supematants from all the yeast strains 
were higher when PDIJ was present in the 2um plasmid (pDB2929) than when it 
wasnot(pDB2931) 
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EXAMPLE 6 



The construction of expression vectors containing various PDI1 genes and the 
expression cassettes for various heterologous proteins on the same 2pm-like 
plasmid 

PCR amplification and cloning of PDI1 genes into Ylpladll: 

The PDI1 genes from S. cerevisiae S288c and S. cerevisiae SKQ2n were 
amplified by PCR to produce DNA fragments with different lengths of the 5'- 
untranslated region containing the promoter sequence. PCR primers were 
designed to permit cloning of the PCR products into the EcoBl and BamUl sites of 
YIplac211 (Gietz & Sugino, 1988, Gene, 74, 527-534). Additional restriction 
endonuclease sites were also incorporated into PCR primers to facilitate 
subsequent cloning. Table 3 describes the plasmids constructed and Table 4 gives 
the PCR primer sequences used to amplify the PDI1 genes. Differences in the 
PD11 promoter length within these YIplac2 1 1 -based plasmids are described in 
Table 3. 

pDB2939 (Figure 27 of WO 2005/061718) was produced by PCR amplification of 
the PDI1 gene from 5. cerevisiae S288c genomic DNA with oligonucleotide 
primers DS248 and DS250 (Table 5), followed by digesting the PCR product with 
£coRI and BamUl and cloning the approximately 1.98-kb fragment into 
YIplac211 (Gietz & Sugino, 1988, Gene, 74, 527-534), that had been cut with 
EcoKl and BamUl. DNA sequencing of pDB2939 identified a missing ; G' from 
within the DS248 sequence, which is marked in bold in Table 4. Oligonucleotide 
primers used for sequencing the PDI1 gene are listed in Table 5, and were 
designed from the published S288c PDI1 gene sequence (PD11/YCL043C on 
chromosome III from coordinates 50221 to 48653 plus 1000 base pairs of 
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upstream sequence and 1000 base pairs of downstream sequence. 
nm p://www. vftastpenome.org/ Genebank Accession number NC001 135). 

Table 3: Ylplac21 1 -based Plasmids Containing PDll Genes 



Plasmid 


Plasmid 
Base 


PDll Gene 


PCR Primers I 


Source 


Promoter 


Terminator 


pDB2939 


YIplac211 


S288c 


Long (~210-bp) 


Bsu36I 


DS248 +DS250 


pDB2941 


YIplac211 


S288c 


Medium (~140-bp) 


-> Bsu36I 


DS251 +DS250 


pDB2942 


YIplac2U 


S288c 


Short (~80-bp) 


-> Bsu36I 


DS252 + DS250 


pDB2943 


Ylplac211 


| SKQ2n 


Long (~210-bp) 


-> Bsu36I 


DS248 +DS250 


pDB2963 


YIplac211 


SKQ2n 


Medium (-140- bp) 


Bsu36I 


DS267 +DS250 


pDB2945 


Ylplac211 


SKQ2n 


Short (~80-bp) 


-> Bsu36I 


DS252 + DS250 
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Table 



4: Oligonucleotide Primers for PCR Amplification of S. cerevisiae PDIl 



Genes 



Primer 


Sequence 


DS248 


5 ' -GTCAGAATTCGAGCTCTACGTATTAATTAAGGCCGGCCAGGCCCGGGCTAGT 
CTCTTTTTCCAATTTGCCACCGTGTAGCATTTTGTTGT-3' 


DS249 


5 < -GTCAGGATCCTACGTACCCGGGGATATCATTATCATCTTTGTCGTGGTCATCT 

TGTGTG-3' 


DS250 


5 ' -GTCAGGATCCTACGTACCCGGGTAAGGCGTTCGTGCAGTGTGACGAATAT 

AGCG-3' ! 


DS251 


5 ' -GTCAGAATTCGAGCTCTACGTATTAATTAAGGCCGGCCAGGCCCGGGCCCGT 
ATGGACATACATATATATATATATATATATATATATTTTGTTACGCG- 3 ' 


DS252 


5 ' -GTCAGAATTCGAGCTCTACGTATTAATTAAGGCCGGCCAGGCCCGGGCTTGTTG 
CAAGCAGCATGTCTAATTGGTAATTTTAAAGCTGCC-3 ' 


DS267 


5 • -GTCAGAATTCGAGCTCTACGTATTAATTAAGGCCGGCCAGGCCCGGGCCCGTA 
T GGAC AT AC AT AT AT AT AT AT AT AT AT AT AT AT AT AT AT T TT G TT AC GC G - 3 ' . 
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Table 5: Oligonucleotide Primers for DNA Sequencing S. cerevisiae PDI1 Genes 



Primer 


Sequence 


DS253 ! 


5' -CCTCCCTGCTGCTCGCC-3' 


DS254 


5' -CTGTAAGAACATGGCTCC-3' 


DS255 


5' -CTCGATCGATTACGAGGG-3' 


DS256 


5 ' -AAGAAAGCCGATATCGC- 3 ' j 


DS257 


5' -CAACTCTCTGAAGAGGCG-3 ' 


DS258 


5 ' -CAACGCCACATCCGACG- 3 ' 


DS259 


5 ' -GTAATTCTGATCACTTTGG- 3 ' 


DS260 


5' -GCACTTATTATTACTACGTGG- 3 ' 


DS261 


5' -GTTTTCCTTGATGAAGTCG-3 ' 


DS262 


5' -GTGACCACACCATGGGGC-3 ' 


DS263 


c,' -r;TTGCCGGCGTGTCTGCC-3 ' 


DS264 


5' -TTGAAATCATCGTCTGCG-3' 


DS265 - 


5' -CGGCAGTTCTAGGTCCC-3 ' 


DS266 


5' -CCACAGCCTCTTGTTGGG-3 ' 


M13/pUC Primer (-40) 


5' -GTTTTCCCAGTCACGAC-3 ' 



Plasmids pDB2941 (Figure 28 of WO 2005/061718) and pDB2942 (Figure 29 of 
WO 2005/061718) were constructed similarly using the PCR primers described in 
Tables 3 and 4, and by cloning the approximately 1.90-kb and 1.85-kb EcoRl- 
BamHL fragments, respectively, into YIplac211. The correct DNA sequences 
were confirmed for the PDI1 genes in pDB2941 and pDB2942. 
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The S. cerevisiae SKQ2n PDI1 gene sequence was PCR amplified from plasmid 
DNA contaimng the PDI1 gene from pMA3a:C7 (US 6.291,205), also known as 
Clone C7 (Crouzet & Tuite, 1987, supra; Farquhar el al, 1991, supra). The 
SKQ2n PDI1 gene was amplified using oligonucleotide primers DS248 and 
DS250 (Tables 3 and 4). The approximately 2.01-kb PCR product was digested 
with EcoKl and BamHl and ligated into YIplac21 1 (Gietz & Sugino, 1988, Gene, 
74, 527-534) that has been cut with EcoKL and BamHl, to produce plasmid 
P DB2943 (Figure 30 of WO 2005/061718). The 5' end of the SKQ2n PDI1 
sequence is analogous to a blunt-ended Spel-site extended to include the EcoRl, 
Sad, Snam, Pad, Fsel, Sfil and Smal sites, the 3' end extends up. to a site 
analogous to a blunt-ended Bsu36l site, extended to include a Smal, SnaSl and 
BamHl sites. The PDI1 promoter length is approximately 210bp. The entire 
DNA sequence was determined for the PDI1 fragment using oligonucleotide 
primers given in Table 5. This confirmed the presence of a coding sequence for 
the PDI protein of S. cerevisiae strain SKQ2n (NCBI accession number 
CAA38402), but with a serine residue at position 114 (not an arginine residue as 
previously published). Similarly, in the same way as in the S. cerevisiae S288c 
sequence in pDB2939, pDB2943 also had a missing e G' from within the DS248 
sequence, which is marked in bold in Table 4. 

Plasmids pDB2963 (Figure 31 of WO 2005/061718) and pDB2945 (Figure 32 of 
WO 2005/061 718) were constructed similarly using the PCR primers described in 
Tables 3 and 4, and by cloning the approximately 1.94-kb and 1.87-kb EcoKl- 
BamHl fragments, respectively, into YIplac211. The expected DNA sequences 
were confirmed for the PDI1 genes in pDB2963 and pDB2945, with a serine 
codon at the position of amino acid 1 1 4. 
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The construction of pSAC35-based rHA expression plasmids with different 
PDI1 genes inserted at the A'cml-site after REP2: 

pSAC35-based plasmids were constructed for the co-expression of rHA with 
different PDI1 genes (Table 6). 



Table 6: pSAC35-based plasmids for co-expression of rHA with different PDI1 
genes 



riasmia 


Base 


PD11 Gene at Xcml-site after REP 2 


Heterologous Protein 
Expression Cassette 
(at Notl-site) 


Source 


Promoter 


Terminator 


Orientation 


pDB2982 


pSAC35 1 


SKQ2n 


Long 


Bsu5<5/ 


A 


rHA 


pDB2983 


pSAC35 


SKQ2n 


Long 


-> Bsu36I 


■ 


rHA 


pDB2984 


pSAC35 


SKQ2n 


Medium 


-> Bsuitf/ 




rHA 


pDB2985 


pSAC35 


SKQ2n 


Medium 


~> Bsuitf/ 


B 


rHA 


pDB2986 


pSAC35 


SKQ2n 


Short 


-> Bsu36I 


A 


rHA 


pDB2987 


pSAC35 


SKQ2n 


Short 


-> Bsu36I 


£ 


rHA 


pDB2976 


pSAC35 


S288c 


Long 


Bsu36J 


A 


rHA 


pDB2977 


pSAC35 


S288c 


Long 


-> Bsu36I 


B 


rHA 


pDB2978 


pSAC35 


S288c 


Medium 


-> Bsu36J 


A 


rHA 


pDB2979 


pSAC35 


S288c 


Medium 


-> Bsu36I 


B 


rHA 


pDB2980 


pSAC35 


S288c 


Short 


-> Bsu36I 


A 


rHA 


pDB2981 


pSAC35 


S288c 


Short 


-> Bsx\36I 


B 


rHA 



.The rHA expression cassette from pDB2243 (Figure 33 of WO 2005/061718, as 
also described in WO 00/44772) was first isolated on a 2,992-bp Notl fragment, 
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which subsequently was cloned into the Mtfl-site of pDB2688 (Figure 4 of WO 
2005/061718) to produce P DB2693 (Figure 34 of WO 2005/061718). pDB2693 
was digested with SnaBl, treated with calf intestinal alkaline phosphatase, and 
ligated & with SnaBl fragments containing the PDU genes from pDB2943, 
PDB2963, P DB2945, pDB2939, pDB2941 and P DB2942. This produced 
plasmids pDB2976 to P DB2987 (Figures 35 to 46 of WO 2005/061718). PDU 
transcribed in the same orientation as REP2 was designated "orientation A", 
whereas PDI1 transcribed in opposite orientation to REP2 was designated 
"orientation B" (Table 6). 

The construction of pSAC35-bascd transferrin expression plasmids with 
different PDI1 genes inserted at the Xcml-site after REP2: 

pSAC35-based plasmids were constructed for the co-expression of recombinant 
transferrin (N413Q.N611Q) with different PDU genes (Table 7). 

Table 7: pSAC35-based plasmids for co-expression of transferrin with different 
PDU genes 



Plasmid 


Plasmid 


PD11 Gene at Xcml-site after REP2 




Heterologous Protein 




Base . 


Source 


Promoter 


Terminator 


Orientation 


Expression Cassette 
(at Notl-site) 


pDB2929 


pSAC35 


SKQ2n 


Long 


-> Bsu36J 


A 


rTf(N413& N611Q) 


pDB3085 


pSAC35 


S288c 


Long 


Bsu3<f/ 


A 


rTf(N413Q, N6J1Q) 


pDB3086 


pSAC35 


S288c 


Medium 


—> Bsu36J 


A 


1 rTf(N413Q,N611Q) i 


pDB3087 


pSAC35 


S288c 


Short 


-+Bsu36/ 


A 


rTf(N413Q, N611Q) 



In order to achieve this, the NotI expression cassettes for rHA expression were 
first deleted from pDB2976, pDB2978, and pDB2980 by NotI digestion and 
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circularisation of the vector backbone. This produced plasmids pDB3081 (Figure 
47 of WO 2005/061718), pDB3083 (Figure 48 of WO 2005/061718) and 
pDB3084 (Figure 49 of WO 2005/061 71 8) as described in Table 8. 



5 Table 8: pSAC35-based plasmids with different PDI1 genes 



Plasmid 


Plasmid 
Base 


PDI1 Gene at Xcml-site after REP2 


Heterologous Protein 
Expression Cassette 
(at Notl-site) 


Source 


Promoter 


Terminator 


Orientation 


pDB2690 


pSAC35 


SKQ2n 


Long 


Bsuitf/ 


A 


None 


pDB3081 


P SAC35 


S288c 


Long 


Bsu36I 


A 


None 


pDB3083 


pSAC35 


S288c 


Medium 


Bsu36I 


A 


None 


pDB3084 


pSAC35 


S288c 


Short 


-» Bsu567 


A 


None 



The 3,256-bp NotI fragment from pDB2928 (Figure 1 1 of WO 2005/061718) was 
cloned into the Notl-sites of pDB3081, pDB3083 and pDB3084, such that 
10 transcription from the transferrin gene was in the same direction as LEU 2. This 
produced plasmids pDB3085 (Figure 50 of WO 2005/061718), pDB3086 (Figure 
51 of WO 2005/061718) and pDB3087 (Figure 52 of WO 2005/061718) as 
described in Table 7. 

15 EXAMPLE 7 

Insertion and optimisation of a PDI1 gene in the 2jjm-like plasmid increased 
the secretion of recombinant human serum albumin by various different S. 
cerevisiae strains 

20 

The S. cerevisiae strains JRY188 cir°, MT302/28B cir°, S150-2B cir° , CB 11-63 
ci"r° (all described above), AH22 cir° (Mead et al, 1986, Mol. Gen. Genet., 205, 
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10 



15 



20 



417-*>1) and DS569 cir° (Sleep et al, 1991, Bio/Technology, 9, 183-187) were 
transformed to leucine prototrophy with either P DB2244 (WO 00/44772), 
P DB2976 (Fieure 35 of WO 2005/061718), P DB2978 (Figure 37 of WO 
2005/061718) or P DB2980 (Figure 39 of WO 2005/061718) using a modified 
lithium acetate method (Sigma yeast transformation kit, YEAST- 1 , protocol 2: 
(Ito et al, 1983, J. Bacterial, 153, 163: Elble, 1992, Biotechniques, 13, 18)). 
Transformants were selected on BMMD-agar plates with appropriate supplements, 
and were subsequently patched out on BMMD-agar plates with appropriate 
supplements. 

Transformants of each strain were inoculated into lOmL YEPD in 50mL shake 
flasks and incubated in an orbital shaker at 30°C, 200rpm for 4-days. Culture 
supernatants were harvested and the recombinant albumin titres compared by 
rocket Immunoelectrophoresis (Figures 11 and 12). The results indicated that the 
albumin titres in the culture supernatants from all the yeast strains were higher 
when PDI1 was present in the 2um plasmid than when it was not (pDB2244). 
The albumin titre in the culture supernatants in the absence of PDI1 on the 
plasmid was dependant upon which yeast strain was selected as the expression 
host, however, in most examples tested the largest increase in expression was 
obse'rved when PDI1 with the long promoter (~210-bp) was present in the 2um 
plasmid (pDB2976). Modifying the PD11 promoter by shortening, for example to 
delete reflation sites, had the affect of controlling the improvement. For one 
yeast strain, known to be a high rHA producing strain (DS569) a shorter promoter 
was preferred for optimal expression. 
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EXAMPLE 8 

PDI1 on the 2/Mn-based plasmid enhanced the secretion of recombinant 
albumin fusions. 

The affect of co-expression of the 5. cerevisiae SKQ2n PDI1 gene with the long 
promoter (~210-bp) upon the expression of recombinant albumin fusions was 
investigated. 

The plasmids defined in Table 9 below were generated as described in Example 9 
of WO 2005/061718, the contents of which are incorporated herein by reference. 



Table 9: Summary of plasmids encoding albumin fusion proteins 



Albumin fusion 


Plasmid name 




With PDU 


Without PDU 


endostatin-albumin 


pDB3100 


PDB3099 


angiostatin-albumin 


pDB3107 


pDB2765 


Kringle5-(GGS) 4 GG-albumin 


pDB3104 


pDB2773 


DX-890-(GGS) 4 GG-albumin 


pDB3102 


pDB3101 


DPI-14-(GGS) 4 GG-albumin 


pDB3103 


pDB2679 


Axokine™-(GGS) 4 GG-albumin 


pDB3106 


pDB2618 


human IL10-(GGS)4-GG-albumin 


pDB3105 


pDB2621 



The same control yeast strain as used in previous examples was transformed to 
leucine prototrophy using a modified lithium acetate method (Sigma yeast 
transformation kit, YEAST- 1, protocol 2; (Ito et al, 1983, J. Bacteriol, 153, 163; 
Elble, 1992, Biotechniques, 13, 18)). Transformants were selected on BMMD- 
agar plates, and were subsequently patched out on BMMD-agar plates. 
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Cryopreserved trehalose stocks were prepared from lOmL BMMD shake flask 
cultures (24 hrs, 30°C, 200rpm). 

Transformants of each strain were inoculated into lOmL BMMD in 50mL shake 
5 flasks and incubated in an orbital shaker at 30°C, 200rpm for 4-days. Culture 
supernatants were harvested and the recombinant albumin fusion titres compared 
by rocket immunoelectrophoresis (Figure 13). The results indicated that the 
albumin fusion titre in the culture supernatants from yeast strain was higher when 
PD11 was present in the 2 urn plasmid than when it was not. 

10 

The increase in expression of the albumin fusions detected by rocket 
immunoelectrophoresis was further studied by SDS-PAGE analysis. BMMD 
shake flask cultures of YBX7 expressing various albumin-fusions were grown for 
4-days in an orbital shaker at 30°C, 200rpm. A sample of the culture supernatant 
15 was analysed by SDS-PAGE (Figure 14). A protein band of the expected size for 
the albumin fusion under study was observed increase in abundance. 

EXAMPLE 9 

20 Co-expression of S. cerevisiae ORM2 and recombinant transferrin on a 2fjm- 
based plasmid 

The S. cerevisiae Control Strain and Strain A (as described in Example 3) were 
selected to investigate the effect on transferrin secretion when the transferrin and 

25 ORM2 genes were co-expressed from the 2|am-based plasmids. The Control 
Strain and Strain A were transformed to leucine prototrophy by plasmids 
pDB3090, pDB3092 and pBD3093 (containing expression cassettes for rTf 
(N413Q, N611Q) and for ORM2), as well as a control plasmid pDB2931 
(containing the rTf (N413Q, N611Q) expression cassette without ORM2). The 

30 construction of these plasmids is described in Example 10 of WO 2005/061718, 
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the contents of which are incorporated herein by reference. Transformants were 
selected on BMMD agar and patched out on BMMD agar for subsequent analysis. 

To investigate the effect of ORM2 co-expression on transferrin secretion, 1 OmL 
5 selective (BMMD) and non-selective (YEPD) liquid media were inoculated with 
strains containing the ORM2/transferrin co-expression plasmids. The shake flask 
culture was then incubated at 30°C with shaking (200 rpm) for 4 days. The 
relative level of transferrin secretion was determined by rocket gel 
Immunoelectrophoresis (RIE) (Figure 15). 

10 

Levels of transferrin secreted from Control Strain [pDB3090] and Control Strain 
[pDB3092] were greater than the levels from Control Strain [pDB2931] in both 
BMMD and YEPS media. Similarly, the levels of transferrin secreted from both 
Strain A [pDB3090] and Strain A [pDB3093] were greater than the levels from 

15 Strain A [pDB293 1] in both BMMD and YEPS media. Transferrin secretion from 
all Strain A transformants was higher than the Control Strain transformants grown 
in the same media. Strain A contains an additional copy of PDI1 in the genome, 
which enhanced transferrin secretion. Therefore in Strain A, the increased 
expression of ORM2 and PDI1 had a cumulative effect on the secretion of 

20 transferrin. 

EXAMPLE 10 

Co-expression of S. cerevisiae PSE1 and recombinant transferrin on a 2jjm- 
25 based plasmid 

The S. cerevisiae Control Strain was transformed to leucine prototrophy by 
plasmids, pDB3097 and pBD3098 (containing expression cassettes for rTf 
(N413Q, N611Q) and for PSE1), as well as a control plasmid pDB2931 
30 (containing the rTf (N413Q, N611Q) expression cassette without PSE1). The 
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construction of these plasmids are described in Example 11 of WO 2005/061718, 
the contents of which are incorporated herein by reference. Transformants were 
selected on BMMD agar and patched out on BMMD agar for subsequent analysis. 

5 To investigate the effect of PSE1 expression on transferrin secretion, flasks 
containing lOmL selective (BMMD) liquid media were inoculated with strains 
containing the PSE1 /transferrin co-expression plasmids. The shake flask culture 
was then incubated at 30°C with shaking (200 rpm) for 4 days. The relative level 
of transferrin secretion was determined by rocket gel Immunoelectrophoresis 
10 (RIE) (Figure 16). 

Levels of transferrin secreted from Control Strain [pDB3097] and Control Strain 
[pDB3098] were greater than the levels from Control Strain [pDB2931] in 
BMMD media. Therefore, expression of PSE1 from the 2|im-based plasmids had 
15 enhanced transferrin secretion from S. cerevisiae. Transferrin secretion was 
improved with the PSE1 gene transcribed in either direction relative to the REP2 
gene in pDB3097 and pDB3098. 

EXAMPLE 11 

20 

Co-expression of S. cerevisiae SSA1 and recombinant transferrin on a 2jjm- 
based plasmid 

The S. cerevisiae Control Strain was transformed to leucine prototrophy by 
25 plasmids, pDB3094 and pBD3095 (containing expression cassettes for rTf 
(N413Q, N611Q) and for SSAI), as well as a control plasmid pDB2931 
(containing the rTf (N413Q, N611Q) expression cassette without SSA1). The 
construction of these plasmids are described in Example 12 of WO 2005/061718, 
the contents of which are incorporated herein by reference. Transformants were 
30 selected on BMMD agar and patched out on BMMD agar for subsequent analysis. 
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10 



15 



20 



To investigate the effect of SSA1 expression on transferrin secretion, flasks 
containing lOmL selective (BMMD) liquid media were inoculated with strains 
containing the SSA1 /transferrin co-expression plasmids. The shake flask cultures 
were incubated at 30°C with shaking (200 rpm) for 4 days. The relative level of 
transferrin secretion was determined by rocket gel Immunoelectrophoresis (RIE) 
(Figure 17). 

Levels of transferrin secreted from Control Strain [pDB3095] were greater than 
the levels from Control Strain [pDB2931] and Control Strain [pDB3094] in 
BMMD media. Therefore, expression of SSA1 from the 2um-based plasmids had 
enhanced transferrin secretion from 5. cerevisiae. Transferrin secretion was 
improved with the SSA1 gene transcribed in the opposite direction relative to the 
REP 2 gene in pDB3094. 

EXAMPLE 12 

PDU gene disruption, combined with a PDI1 gene on the 2ym-based plasmid 
enhanced the secretion of recombinant albumin and plasmid stability. 

Single stranded oligonucleotide DNA primers listed in Table 1 1 were designed to 
amplify a region upstream of the yeast PD11 coding region and another a region 
downstream of the yeast PD11 coding region. 
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Primer 


Description 


Sequence 


DS299 


5' PDI1 
primer, 38mer 


5'- CGTAGCGGCCGCCTGAAAGGGGTTG ACCGTCCGT 
CGGC-3 1 


DS300 


5' PDI1 
primer, 40mer 


S'-CGTAAAGCTTCGCCGCCCGACAGGGTAACATATTAT 
CAC-3' 


DS301 


3' PDI1 
primer, 38mer 


- , f~~\s~^ np a a a pttp a r^r^ a. r^r^T a r,TA ata ata a r^TdC^ A T 

5 -CGTAAAGCTI uACtALulAU I aaiaaiaau i ulai i 

GGC-3' 


DS302 


3' PDI1 
primer, 41mer 


5'-CGTACTGCAGATTGGATAGTGATTAGAGTGTATAGTCC 
CGG-3' 


DS303 


18mer 


5'-GGAGCGACAAACCTTTCG-3' 


DS304 


20mer 


5 '-ACCGTAATAAAAGATGGCTG-3 ' 


DS305 


24mer 


5' -CATCTTGTGTGTGAGTATGGTCGG-3' 


DS306 


14mer 


5 5 -CCC AGG ATAATTTTCAGG-3 ' 



Primers DS299 and DS300 amplified the 5' region of PDI1 by PCR, while primers 
5 DS301 and DS302 amplified a region 3' of PDI1, using genomic DNA derived 
S288c as a template. The PCR conditions were as follows: luL S288c template 
. DNA (at O.Olng/uL, O.lng/uL, Ing/uL, lOng/uL and lOOng/uL), 5uL lOXBuffer 
(Fast Start Taq+Mg, (Roche)), luL lOmM dNTP's, 5uL each primer (2uM), 
0.4uL Fast Start Taq, made up to 50uL with H 2 0. PCRs were performed using a. 
10 Perkin-Elmer Thermal Cycler 9700. The conditions were: denature at 95°C for 
4min [HOLD], then [CYCLE] denature at 95°C for 30 seconds, anneal at 45°C for 
30 seconds, extend at 72°C for 45 seconds for 20 cycles, then [HOLD] 72°C for 
lOmin and then [HOLD] 4°C. The 0.22kbp PDI1 5' PCR product was cut with 
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3 



10 



15 



20 



Notl and Hindm, while the 0.34kbp PDI1 3' PCR product was cut with Hindm 
and Pstl. 

Plasmid pMCS5 (Hoheisel, 1994, Biotechniques 17, 456-460) (Figure 85 of WO 
2005/06171 8) was digested to completion with Hindm, blunt ended with T4 DNA 
polymerase plus dNTPs and religated to create pDB2964 (Figure 86 of WO 
2005/061718). 

Plasmid pDB2964 was Hindm digested, treated with calf intestinal phosphatase, 
and ligated with the 0.22kbp PDI1 5' PCR product digested with Notl and Hindm 
and the 0.34kbp PDI1 3' PCR product digested with Hindm and Pstl to create 
pDB3069 (Figure 87 of WO 2005/061718) which was sequenced with forward 
and reverse universal primers and the DNA sequencing primers DS303, DS304, 
DS305 and DS306 (Table 1 1). 



Primers DS234 and DS235 (Table 12) were used to amplify the modified TRP1 
marker gene from Ylplac204 (Gietz & Sugino, 1988, Gene, 74, 527-534), 
incorporating Hindm restriction sites at either end of the PCR product. The PCR 
conditions were as follows: luL template YIplac204 (at O.Olng/uL, O.lng/uL, 
lng/u-L, lOng/uL and lOOng/uL), 5uL lOXBuffer (Fast Start Taq+Mg, (Roche)), 
1HL lOmM dNTP's, 5uL each primer (2uM), 0.4uL Fast Start Taq, made up to 
50uL with H 2 0. PCRs were performed using a Perkin-Elmer Thermal Cycler 
9600. The conditions were: denature at 95°C for 4min [HOLD], then [CYCLE] 
denature at 95°C for 30 seconds, anneal for 45 seconds at 45°C, extend at 72°C 
for 90sec for 20 cycles, then [HOLD] 72°C for lOmin and then [HOLD] 4°C. The 
0.86kbp PCR product was digested with Hindm and cloned into the Hindm site 
of pMCS5 to create pDB2778 (Figure 88 of WO 2005/061718). Restriction 
enzyme digestions and sequencing with universal forward and reverse primers as 
well as DS236, DS237, DS238 and DS239 (Table 12) confirmed that the sequence 
30 of the modified TRP1 gene was correct. 
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Table 12: Oligonucleotide primers 



Primer 


Description 


Sequence 


DS230 


TRP 1 5' UTR 


5'-TAGCGAATTC AATCAGTAAAAATCAACGG-3 ' 


DS231 


TRP1 5' UTR 


5 '-GTC AAAGCTTCAAAAAAAG A AAAGCTCCGG-3 1 


DS232 


TRP 1 3' UTR 


5'-TAGCGGATCCGAATTCGGCGGTTGTTTGCAAGACC 

GAG-3' 


DS233 


TRP J 3' UTR 


5 , -GTCAAAGCTTTAAAGATAATGCTAAATCATTTGG-3' 


DS234 


TRP1 


5 '-TGACAAGCTTTCGGTCG AAAAAAGAAAAGG AG 
AGG-3' 


DS235 


TRP1 


5'-TGACAAGCTTGATCTTTTATGCTTGCTTTTC-3' 




T7?V> 7 
1 Kl I 


5'-AATAGTTCAGGCACTCCG-3' 


DS237 


TRP1 


S'-TGGAAGGCAAGAGAGCC^ 5 


DS238 


TRP1 


5'-TAAAATGTAAGCTCTCGG-3 1 


DS239 


TRP1 


5 '-CC AACCAAGTATTTCGG-3 ' 


CED005 


ATRP1 


5 '-G AGCTG ACAGGG AAATGGTC-3 ' 


CED006 


ATRP1 


5'-TACGAGGATACGGAG AG AGG-3' j 



The 0.86kbp TRP1 gene was isolated from pDB2778 by digestion with Hindlll 
and cloned into the Hindlll site of pDB3069 to create pDB3078 (Figure 89 of WO 
2005/061718) and pDB3079 (Figure 90 of WO 2005/061718). A 1.41kb 
pdil::TRPl disrupting DNA fragment was isolated from pDB3078 or pDB3079 
by digestion with NotVPstl. 
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Yeast strains incorporating a TRP1 deletion (trplA) were to be constructed in such 
a way that no homology to the TRP1 marker gene (pDB2778) should left in the 
genome once the trplA had been created, so preventing homologous 
recombination between future TRP1 containing constructs and the TRP1 locus. In 

5 order to achieve the total removal of the native TRP1 sequence from the genome 
of the chosen host strains, oligonucleotides were designed to amplify areas of the 
5' UTR and 3' UTR of the TRP1 gene outside of TRP1 marker gene present on 
integrating vector YIplac204 (Gietz & Sugino, 1988, Gene, 74, 527-534). The 
YIplac204 TRP1 marker gene differs from the native/chromosomal TRP1 gene in 

10 that internal i/mdIII, Pstl and Xbal sites were removed by site directed 
mutagenesis (Gietz & Sugino, 1988, Gene, 74, 527-534). The YIplac204 
modified TRP1 marker gene was constructed from a 1.453kbp blunt-ended 
genomic fragment EcoRl fragment, which contained the TRP1 gene and only 
102bp of the TRP1 promoter (Gietz & Sugino, 1988, Gene, 74, 527-534). 

] 5 Although this was a relatively short promoter sequence it was clearly sufficient to 
complement trpl auxotrophic mutations (Gietz & Sugino, 1988, Gene, 74, 527- 
534). Only DNA sequences upstream of the EcoRl site, positioned 102bp 5' to 
the start of the TRP1 ORF were used to create the 5 5 TRP1 UTR. The selection of 
the V UTR was less critical as long as it was outside the 3' end of the functional 

20 modified TRP1 marker, which was chosen to be 85bp downstream of the 
translation stop codon. 

Single stranded oligonucleotide DNA primers were designed and constructed to 
amplify the 5' UTR and 3' UTR regions of the TRP1 gene so that during the PCR 

25 amplification restriction enzyme sites would be added to the ends of the PCR 
products to be used in later cloning steps. Primers DS230 and DS231 (Table 12) 
amplified the 5' region of TRP1 by PCR, while primers DS232 and DS233 (Table 
12) amplified a region 3' of TRP1, using S288c genomic DNA as a template. The 
PCR conditions were as follows: \\\L template S288c genomic DNA (at 

30 O.Olng/^iL, O.lng/^iL, lng/^L, lOng/^L and lOOng/^iL), 5|aL lOXBuffer (Fast 
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Start Taq+Mg, (Roche)), I\xL lOmM dNTP's, 5yL each primer (2fiM), 0.4|aL Fast 
Start Taq, made up to 50jaL with H 2 0. PCRs were performed using a Perkin- 
Elmer Thermal Cycler 9600. The conditions were: denature at 95°C for 4min 
[HOLD], then [CYCLE] denature at 95°C for 30 seconds, anneal for 45 seconds at 
5 45°C, extend at 72°C for 90sec for 20 cycles, then [HOLD] 72°C for lOmin and 
then [HOLD] 4°C. 

The 0.19kbp TRP1 5' UTR PCR product was cut with EcoJU and HindUI, while 
the 0.2kbp TRP1 3' UTR PCR product was cut with BamYLI and HindUI and 

10 ligated into pAYE505 linearised with BawHI/EcoRl to create plasmid pDB2777 
(Figure 91 of WO 2005/061718). The construction of pAYE505 is described in 
WO 95/33833 . DNA sequencing using forward and reverse primers, designed to 
prime from the plasmid backbone and sequence the cloned inserts, confirmed that 
in both cases the cloned 5' and 3' UTR sequences of the TRP1 gene had the 

15 expected DNA sequence. Plasmid pDB2777 contained a TRP1 disrupting 
fragment that comprised a fusion of sequences derived from the 5' and 3' UTRs of 
TRPL This 0.383kbp TRP1 disrupting fragment was excised from pDB2777 by 
complete digestion with EcoRI. 

20 Yeast strain DXY1 (Kerry- Williams et a/., 1998, Yeast, 14, 161-169) was 
transformed to leucine prototrophy with the albumin expression plasmid pDB2244 
using a modified lithium acetate method (Sigma yeast transformation kit, YEAST- 
1, protocol 2; (Ito et al, 1983, J. BacterioL, 153, 163; Elble, 1992, Biotechniques, 
13, 18)) to create yeast strain DXY1 [pDB2244]. The construction of the albumin 

25 expression plasmid pDB2244 is described in WO 00/44772. Transformants were 
selected on BMMD-agar plates, and were subsequently patched out on BMMD- 
agar plates. Cryopreserved trehalose stocks were prepared from lOmL BMMD 
shake flask cultures (24 hrs, 30°C, 200rpm). 
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10 



DXY1 [pDB2244] was transformed to tryptophan autotrophy with the 0.383kbp 
EcoW TRP1 disrupting DNA fragment from pDB2777 using a nutrient agar 
incorporating the counter selective tryptophan analogue, 5-fluoroanthranilic acid 
(5-FAA), as described by Toyn el al, (2000 Yeast 16, 553-560). Colonies 
resistant to the toxic effects of 5-FAA were picked and streaked onto a second 
round of 5-FAA plates to confirm that they really were resistant to 5-FAA and to 
select away from any background growth. Those colonies which grew were then 
were re-patched onto BMMD and BMMD plus tryptophan to identify which were 
tryptophan auxotrophs. 

Subsequently colonies that had been shown to be tryptophan auxotrophs were 
selected for further analysis by transformation with YCplac22 (Gietz & Sugino, 
. 1988, Gene, 74, 527-534) to ascertain which isolates were trpl. 

15 PCR amplification across the TRP1 locus was used to confirm that the trp" 
phenotype was due to a deletion in this region. Genomic DNA was prepared from 
isolates identified as resistant to 5-FAA and unable to grow on minimal media 
without the addition of tryptophan. PCR amplification of the genomic TRP1 locus 
with primers CED005 and CED006 (Table 12) was achieved as follows: luL 

20 template genomic DNA, 5uL lOXBuffer (Fast Start Taq+Mg, (Roche)), luL 
lOmM dNTP's, 5uL each primer (2uM), 0.4uL Fast Start Taq, made up to 50uL 
with H 2 0. PCRs were performed using a Perkin-Elmer Thermal Cycler 9600. 
The conditions were: denature at 94°C for lOmin [HOLD], then [CYCLE] 
denature at 94°C for 30 seconds, anneal for 30 seconds at 55°C, extend at 72°C 

25 for 120sec for 40 cycles, then [HOLD] 72°C for lOmin and then [HOLD] 4°C. 
PCR amplification of the wild type TRP1 locus resulted in a PCR product of 
1.34kbp in size, whereas amplification across the deleted TRP1 region resulted in 
a PCR product 0.84kbp smaller at 0.50kbp. PCR analysis identified a DXY1 
derived trp". strain (DXY1 trpl A [pDB2244]) as having the expected deletion 

30 event. 
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The yeast strain DXY1 trplA [pDB2244] was cured of the expression plasmid 
pDB2244 as described by Sleep et al, (1991, Bio/Technology, 9, 183-187). 
DXY1 trplA cir° was re-transformed the leucine prototrophy with either 
5 pDB2244, pDB2976, pDB2977, pDB2978 5 pDB2979, pDB2980 or pDB2981 
using a modified lithium acetate method (Sigma yeast transformation kit, YEAST- 
1 3 protocol 2; (Ito et al, 1983, J. Bacterids 153, 163; Elble, 1992 5 Biotechniques, 
13, 18)). Transformants were selected on BMMD-agar plates supplemented with 
tryptophan, and were subsequently patched out on BMMD-agar plates 
10 supplemented with tryptophan. Cryopreserved trehalose stocks were prepared 
from lOmL BMMD shake flask cultures supplemented with tryptophan (24 hrs 5 
30°C ? 200rpm). 

The yeast strains DXY1 trplA [pDB2976] 5 DXY1 trplA [pDB2977], DXY1 
15 trplA [pDB2978], DXY1 trplA [pDB2979], DXY1 trplA [pDB2980] or DXY1 
trplA [pDB2981] was transformed to tryptophan prototrophy using the modified 
lithium acetate method (Sigma yeast transformation kit, YEAST- 1, protocol 2; 
QloetaU 1983, J. BacterioL, 153, 163; Elble, 1992, Biotechniques, 13, 18)) with a 
1.41kb pdil::TRPl disrupting DNA fragment was isolated from pDB3078 by 
20 digestion with NotllPstl. Transformants were selected on BMMD-agar plates and 
were subsequently patched out on BMMD-agar plates. 

Six transformants of each strain were inoculated into 1 OmL YEPD in 50mL shake 
flasks and incubated in an orbital shaker at 30°C, 200rpm for 4-days. Culture 

25 supernatants and cell biomass were harvested. Genomic DNA was prepared (Lee, 
1992, Biotechniques, 12, 677) from the tryptophan prototrophs and DXY1 
[pDB2244]. The genomic PDI1 locus amplified by PCR of with primers DS236 
and DS303 (Table 11 and 12) was achieved as follows: l^iL template genomic 
DNA, 5^iL lOXBuffer (Fast Start Taq+Mg, (Roche)), ljiL lOmM dNTP's, 5|_tL 

30 each primer (2|iM), 0.4|aL Fast Start Taq, made up to 50^iL with H 2 0. PCRs were 
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performed using a Perkin-Elmer Thermal Cycler 9700. The conditions were: 
denature at 94°C for 4min [HOLD], then [CYCLE] denature at 94°C for 30 
seconds, anneal for 30 seconds at 50°C, extend at 72°C for 60sec for 30 cycles, 
then [HOLD] 72°C for lOmin and then [HOLD] 4°C. PCR amplification of the 
5 wild type PDI1 locus resulted in no PCR product, whereas amplification across 
the deleted PDI1 region resulted in a PCR product 0.65kbp. PCR analysis 
identified that all 36 potential pdil::TRPl strains tested had the expected 
pdil::TRPl deletion. 

10 The recombinant albumin titres were compared by rocket immunoelectrophoresis 
(Figure 18). Within each group, all six pdil::TRPl disruptants of DXY1 trplA 
[pDB2976], DXY1 trplA [pDB2978], DXY1 trplA [pDB2980], DXY1 trplA 
[pDB2977] and DXY1 trplA [pDB2979] had very similar rHA productivities. 
Only the six pdil::TRPl disruptants of DXY1 trplA [pDB2981] showed variation 

15 in rHA expression titre. The six pdil::TRPl disruptants indicated in Figure 18 
were spread onto YEPD agar to isolate single colonies and then re-patched onto 
BMMD agar. 

Three single celled isolates of DXY1 trplA pdil::TRPl [pDB2976], DXY1 trplA 
20 pdil::TRPl [pDB2978], DXY1 trplA pdil::TRPl [pDB2980], DXY1 trplA 
pdil::TRPl [pDB2977], DXY1 trplA pdil::TRPl [pDB2979] and DXY1 trplA 
pdil::TRPl [pDB2981] along with DXY1 [pDB2244], DXY1 [pDB2976], DXY1 
[pDB2978], DXY1 [pDB2980], DXY1 [pDB2977], DXY1 [pDB2979] and DXY1 
[pDB2981] were inoculated into lOmL YEPD in 50mL shake flasks and incubated 
25 in an orbital shaker at 30°C, 200rpm for 4-days. Culture supernatants were 
harvested and the recombinant albumin titres were compared by rocket 
immunoelectrophoresis (Figure 19). The thirteen wild type PDI1 and pdil::TRPl 
disruptants indicated in Figure 19 were spread onto YEPD agar to isolate single 
colonies. One hundred . single celled colonies from each strain were then re- 
30 patched onto BMMD agar or YEPD agar containing a goat anti-HS A antibody to 
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detect expression of recombinant albumin (Sleep et ah, 1991, Bio/Technology, 9, 
183-187) and the Leu+/rHA+, Leu+/rHA-, Leu-/rHA+ or LeiWrHA- phenotype of 
each colony scored (Table 13). 

5 Table 13: 





PDI1 


pdil::TRPl 1 


X-,eu. ' 
rHA+ 


T 

rHA+ 


rHA- 


Leu- 
rHA- 


Leu+ 
rHA+ 


Leu- 
rHA+ 


Leu+ 
rHA- 


Leu- 
rHA- 


pDB2244 


100 


0 


0 


0 










pDB2976 


7 


0 


47 


46 


97 


0 


3 


0 


pDB2978 


86 


0 


0 


14 


100 


0 


0 


0 


pDB2980 


98 


0 


0 


2 


100 


0 


0 


0 


pDB2977 


0 


0 


4 


96 


100 


0 


0 


0 


pDB2979 


69 


0 


6 


25 


100 


0 


0 


0 


pDB2981 


85 


0 


0 


15 


92 


0 


0 


8 



These data indicate plasmid retention is increased when the PDU gene is used as a 
selectable marker on a plasmid in a host strain having no chromosomally encoded 
0 PDI, even in a non-selective medium such as the exemplified rich medium. 
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Example 13 

Construction of the pSAC35-based expression vectors pDB3175-pDB31 82 for 
co-expression of PDI1 with recombinant human albumin or transferrin 
5 (N413Q, N611Q)from the SnaBI/Notl-site in the 2/jm UL-region 

The Not\ expression cassette from the pSAC35-based expression vector, 
pAYE316 (Sleep el al 5 1991, Biotechnology (N Y), 9, 183-1 87),. designed for the 
secretion of recombinant human albumin, was cloned into the unique Afofl-site of 
10 the E. coli cloning vector pBST(+) (Sleep et al 9 2001, Yeast, 18, 403-421) to 
produce plasmid pQC262e. pQC262e was subsequently modified by site-directed 
mutagenesis (Kunkel et al 9 1987, Methods Enzymol, 154, 367-382) with 
oligonucleotide LRE49 (Table 14) to introduce a unique Asp718I-site immediately 
upstream of the Afo/I-site at the ADH1 terminator region of the expression cassette. 

15 

Table 14: Mutagenic Oligonucleotide 



Primer 


Description 


Sequence 


LRE49 


Mutagenic 45mer 
ADH1 terminator region 


5'-GCTAGCGTCGACAAGCTTGCGG 
CGCGGTACCGTGTGGAAGAACG-3 5 



This produced plasmid pAYE560 (Figure 21). The S. cerevisiae SKQ2n PDI1 
20 gene with the long (~210-bp) promoter was isolated on 1.96-kb Smal fragment 
from pDB2952 (Figure 46 in WO 2005/061719, the contents of which are 
incorporated herein by reference). The pDB2952 PDI1 fragment was cloned into 
the unique Asp! \%\-site of pAYE560, following digestion with AspllXl, filling 
the 3' -recessed ends using T4 DNA polymerase, and calf intestinal alkaline 
25 phosphatase treatment of the blunt-ended product. - This produced plasmids 
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pDB3171 (Figure 22) with the PDIJ gene transcribed in the' same direction as 
rHA, and pDB3172 (Figure 23) with converging transcription of the PDIJ and 
rHA genes. 

5 pDB3175 and pDB3176 (Figures 24 and 25) were produced by cloning the ~ 5.1- 
kb rHA— >PDIJ— > Notl expression cassette from pDB3171 into pSAC35, which 
had been digested with Notl and calf intestinal alkaline phosphatase. pDB3177 
and pDB3178 (Figures 26 and 27) were produced similarly by cloning the -5.1- 
kb Notl tUA^<^PDI1 expression cassette from pDB3172 into pSAC35 digested 
10 with Notl and calf intestinal alkaline phosphatase. 

To construct Notl expression cassettes for co-expression of recombinant 
unglycosylated human transferrin (N413Q, N611Q) and the S. cerevisiae SKQ2n 
PDI1 gene with the long (~210-bp) promoter, the ~ 6.1-kb Aflll-Sphl fragment 

15 from pDB3171 was ligated with the ~ 2.4-kb Aflll-Sphl fragment from pDB2928 
(Figure 1 1 of WO 2005/061 71 8, the contents of which are incorporated herein by 
reference), thus replacing the rHA coding and adjacent sequences with those for 
transferrin (N413Q, N61 1 Q) secretion. This produced plasmids pDB3173 (Figure 
28) with the PDI1 gene transcribed in the same direction as rTf (N41 3Q, N61 1 Q), 

20 and pDB3174 (Figure 29) with converging transcription of the PDI1 and rTf 
(N413Q,N611Q) genes. 

pDB3179 and pDB3180 (Figures 30 and 31) were produced by cloning the ~ 5.2- 
kb rTf-^ PDIJ-* Notl expression cassette from pDB3173 into pSAC35, which 
25 had been digested with Notl and calf intestinal alkaline phosphatase. pDB3181 
and pDB3182 (Figures 32 and 33) were produced by cloning the ~ 5.2-kb Notl 
T Tf-^^.pDll expression cassette from pDB3174 into pSAC35 digested with Notl 
and calf intestinal alkaline phosphatase. 

30 
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Example 14 

S. cerevisiae PDI1 at the SnaBI/Notl-site in the UL-region of pSAC35-based 
expression vectors as a selectable marker in S. cerevisiae strain 
5 DXY1 Atrpl pdil::TRPl 

The yeast strains DXY1 (Kerry-Williams el aL, 1998, Yeast, 14, 161-169) and 
DXY1 Atrpl (see Example 13 of WO 2005/061718, the contents of which are 
incorporated herein by reference) were transformed to leucine prototrophy with 

10 pSAC35-based plasmids pAYE316, pDB3175, pDB3176, pDB3177 5 pDB3178, 
pDB2931, pDB3179, pDB3180, pDB3183 and pDB3182 using a modified lithium 
acetate method (Sigma yeast transformation kit, YEAST- 1, protocol 2; (Ito et al 9 
1983, J BacterioL, 153, 163; Elble, 1992, Biotechniques, 13, 18)). Transformants 
were selected on BMMD-agar plates with appropriate supplements, and were 

1 5 subsequently patched out on BMMD-agar plates with appropriate supplements. 

DXY1 Atrpl [pDB3175], DXY1 Atrpl [pDB3176], DXY1 Atrpl [pDB3177], 
DXY1 Atrpl [pDB3178], DXY1 Atrpl [pDB3179], DXY1 Atrpl [pDB3180], 
DXY1 Atrpl [pDB3181], and DXY1 Atrpl [pDB3182] were transformed to 

20 tryptophan prototrophy using the modified lithium acetate method (Sigma yeast 
transformation kit, YEAST- 1, protocol 2; (Ito et al 9 1983, J. BacterioL, 153, 163; 
Elble, 1992, Biotechniques, 13, 18)) with a 1.41-kb pdil::TRPl disrupting DNA 
fragment isolated from pDB3078 by digestion with NotllPstl. (see Example 13 of 
WO 2005/061718). Transformants were selected on BMMD-agar plates and were 

25 subsequently patched out on BMMD-agar plates. 

Disruption of the PDI1 gene with the TRP1 marker was confirmed by diagnostic 
PCR amplification of an approximately 810-bp product using oligonucleotide 
primers CF247 and DS236 (Table 15). CF247 binds in the PD11 promoter region 
30 upstream of the disruption site and DS236 binds within the TRP1 gene. 
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Table 15 : Oligonucleotide Sequencing Primers 



Primer 


Description 


Sequence 


CF247 


PDI1 promoter region, 
20mer 


5 ' -GCGCGTTTTC ATTAGTGCCC-3 5 


' DS236 


TRP1 coding region, 
i 1 9mer 


5 ' -AAATAGTTC AGGC ACTCCG-3 ' 

-J 



5 DXY1 Atrpl, DXY1 Atrpl containing pDB3175-pDB3182 and putative 
pdilr.TRPl-dismytznts were inoculated into lOmL YEPD in 50mL shake flasks, 
and incubated in an orbital shaker at 30°C, 200rpm for 4-days. Genomic DNA 
was prepared (Lee, 1992, Biotechniques, 12, 677) from the biomass for 
subsequent use as template DNA in diagnostic PCR. 

10 

0,5 uL template genomic DNA, 2.5 uL lOxBuffer (Fast Start Taq+Mg, (Roche)), 
0.5uL lOmM dNTP's, 2.5uL each primer (2uM), 0.2uL Fast Start Taq were mixed 
and made up to 25 uL with H 2 0. PCRs were performed using a Dyad™ DNA 
Engine Peltier thermal cycler (GRI) as follows; denaturation at 95°C for 4 mins, 

15 then 25 cycles of 95°C for 30s, 55°C for 30s and 72°C for 1 min, followed by 
extension at 72°C for 10 mins. A PCR product of ~810-bp was only amplified 
from DNA containing the TRP1 gene integrated at the pdil locus. DXY1 Atrpl 
pdil::TRPl containing pDB3175-pDB3182 were successfully identified. As 
expected, no PCR product was generated for the controls DXY1 Ati-pl or DXY1 

20 Atrpl containing pDB3 1 75-pDB3 1 82. 

DXY1 containing pDB3 1 75-pDB3 1 82 were compared with DXY1 Atipl 

pdil - TRPl containing pDB3175-pDB3182 for plasmid stability and secretion of 
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recombinant human albumin or transferrin (N413Q, N61 1Q) in YEPD shake flask 
culture. lOmL YEPD shake flasks were inoculated with the above strains and 
grown for 4-days at 30°C, 200rpm. Samples were spread onto YEPD-agar plates 
and grown to single colonies. Fifty randomly selected colonies were patched out 

5 in replica onto BMMD and YEPD plates and incubated at 30°C. Plasmid stability 
was scored as the percentage of colonies able to grow on both media. The results 
shown in Tables 16 and 17 demonstrated that all of the plasmids pDB3175- 
pDB3182 were less than 100% stable in DXY1, regardless of the relative 
orientations of the PDI1 and rTf/rHA genes cloned at the SnaEUNofl-site. 

10 However, 100% plasmid stability was determined in all cases in DXY1 Atrpl 
pdil::TRPl containing pDB3175-pDB3182 . Hence, use of PDI1 as the sole 
selectable marker at the SnaBVNotl-siXe of pSAC35-based vectors resulted in 
100% plasmid stability in rich media. 
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Table 16 : Plasmid stability of pSAC35-based vectors containing a recombinant 
albumin gene and the S. cerevisiae PDIJ gene at the SnaBI/Notl-site in the UL- 
region in strains DXY1 and DXY1 Atrpl pdilr.TRPl. lOmL YEPD shake flasks 
were inoculated with DXY1 [pDB3175], DXY1 [pDB3176], DXY1 [pDB3177], 
DXY1 [pDB3178] 9 DXY1 Atrpl pdilr.TRPl [pDB3175], DXY1 Atrpl 
pdilr.TRPl [pDB3176], DX-Y1 Atrpl pdilr.TRPl [pDB3177], and DXY1 Atrpl 
pdilr.TRPl [pDB3178] were grown for 4-days at 30°C, 200rpm. Samples were 
spread onto YEPD-agar plates and grown to single colonies. Fifty randomly 
selected colonies were patched out in replica onto BMMD and YEPD plates and 
incubated at 30°C. Plasmid stability was scored as the percentage of colonies able 
to grow on both media. 



Plasmid 


SnaBl/NotI 
Cassette* 


Plasmid Stabilitj' (% Prototrophs) 


DXY1 


DXY1 Atrpl 


DXY1 Atrpl 
pdilr.TRPl 


pDB3175 


LEU2 rHA PDI1 


96 


0 


100 


pDB3176 


LEU2 PDI1 rHA 
-><-<- 


68 


0 


100 


pDB3177 


LEU2 rHA PDIJ 
-» -> <- 


86 


0 


100 


pDB3178 


LEU2 PDJ1 rHA 
-> <- 


28 


0 


100 



* Arrows indicate the direction of transcription relative to the LEU2 gene. 
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Table 17: Plasmid stability of pSAC35-based vectors containing a recombinant 
transferrin gene and the S. cerevisiae PDI1 gene at the SnaBVNotl-site in the UL- 
region in strains DXY1 and DXY1 Atrpl pdilrTRPl. 10mL YEPD shake flasks 
were inoculated with DXY1 [pDB3179], DXY1 [pDB3180], DXY1 [pDB3181], 
DXY1 [pDB3182], DXY1 Atrpl pdilrTRPl [ P DB3179], DXY1 Atrpl 
pdilr.TKPl [pDB3180] 5 DXY1 Atrpl pdilr.TKPl [pDB3181] : and DXY1 Atrpl 
pdilrTRPl [pDB3182] were grown for 4-days at 30°C, 200rpm. Samples were 
spread onto YEPD-agar plates and grown to single colonies. Fifty randomly 
selected colonies were patched out in replica onto BMMD and YEPD plates and 
incubated at 30°C. Plasmid stability was scored as the percentage of colonies able 
to grow on both media. 



Plasmid 


SnaBl/NotI 
Cassette ! 


Plasmid Stability (% Prototrophs) I 


DXY1 


DXY1 Atrpl 


DXY1 
pdilrTRPl 


pDB3179 


LEU2 rTfPDIl 
-+ 


56 


0 


100 


pDB3180 


LEU2 PDI1 rTf 

-+ <r- <- 


36 


0 


100 | 


pDB3181 


LEU2 rTf PDI1 
-» <- 


28 


0 


100 


pDB3182 


LEU2 PDI1 rTf 
-> -> <- 


15 


0 


100 



* Arrows indicate the direction of transcription relative to the LEU2 gene. 
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DXY1 containing P DB3175-pDB3182 were compared with DXY1 Atrpl 
pdilr.TRPl containing pDB3 1 75-pDB3 1 82 for the secretion of recombinant 
human albumin and transferrin (N413Q, N611Q). Figure 34 shows that following 
disruption of the genomic PDU gene with TRP1 to increase plasmid stability to 
5 1 00% (Table 1 6), there was a reduced recombinant human albumin titre for PDU 
inserted at the SwaBI-site. However, this is more than compensated for by the 
increased genetic stability following disruption of PDU in the genome, which 
increases the reproducibility and reliability of heterologous protein secretion, 
resulting in a more useful industrial organism for application in consistent protein 
1 o production, especially in prolonged cultivation, such as fill and draw fermentation 
or continuous culture fermentation campaigns. Furthermore, when PDU was 
located at the Xcml-site after REP 2 of pSAC35 (Example 12) there was no 
significant decrease in rHA titre following disruption of the genomic PDU gene 
with TRP1 to increase plasmid stability (Table 13). This suggests that the Xcml- 
15 site was preferred (but not essential) compared to the S«oBI-site for PDI1 on 
2um-based plasmids expressing rHA. However, PDU expression can be 
modulated, for example by altering the length of the PDU promoter (Example 7) 
such that an increase in recombinant protein secretion is observed. In the above 
experiment the long PDU promoter was used, which was not the preferred 
20 promoter length for optimal rHA secretion in the closely related high rHA 
producing strain, DS569, where a short promoter resulted in increased rHA 
secretion. In the case of genomic PDU disruption in DXY1 [pDB2977] 
containing a long PDU promoter in PDU at the A«-site after REP 2 of pS AC35 
(Figure 19), the analysis included three individual isolates (not triplicates of a 
25 single isolate as shown in Figure 34), and demonstrated no decrease in rHA titre. 
Furthermore, the increased consistency and reproducibility is clearly demonstrated 
for these cultures following disruption of the genomic PDU gene. 

In Figure 35 a large increase in recombinant transferrin secretion was observed 
30 between DXY1 [pDB2931] without PDU on the pSAC35-based vector (which 
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itself was not 100% stable), and strains containing plasmids pDB3179-pDB3182, 
with PDU at the SnaBVNotl-site. In DXY1 Atrpl pdil::TRPl containing 
plasmids pDB3 1 79-pDB3 1 82, PDI1 on the plasmid was acting as the sole 
selectable marker and resulted in improved genetic stability (see Table 1 7) where 
100% plasmid stability was observed without any decrease in recombinant 
transferrin secretion. Hence, by placing PDI1 on the transferrin expression 
plasmid and disrupting PDI1 in the genome, the overall effect has been to increase 
secretion of the recombinant protein and also to improve the genetic stability of 
the production organism 
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CLAIMS 

1 . A method for producing a desired protein (such as a desired heterologous 
protein) comprising: 

(a) providing a host cell comprising a first recombinant gene encoding 
a protein comprising the sequence of a first chaperone protein, a 
second recombinant gene encoding a protein comprising the 
sequence of a second chaperone protein and a third gene, such a 
third recombinant gene, encoding a desired protein (such as a 
desired heterologous protein), wherein the first and second 
chaperones are different: and 

(b) culturing the host cell in a culture medium to obtain expression of 
the first, second and third genes. 

2. The method of Claim 1 further comprising the step of purifying the thus 
expressed desired protein (such as a desired heterologous protein) from the 
cultured host cell or the culture medium. 

3 . The method of Claim 2 further comprising the step of lyophilising the thus 
purified protein. 

4. The method of Claim 2 or 3 further comprising the step of formulating the 
purified desired protein (such as a desired heterologous protein) with a 
carrier or diluent and optionally presenting the thus formulated protein in a 
unit dosage form. 

5. A method according to any preceding claim wherein one or both of the 
first or second chaperone proteins has a sequence of a fungal chaperone 
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(optionally a yeast chaperone) or a mammalian chaperone (optionally a 
human chaperone). 

A method according to any one of the preceding claims wherein one or 
both of the first and second chaperones each individually comprise the 
sequence of a protein encoded by any one of AHA J, CCT2, CCT3, CCT4, 
CCT5, CCT6, CCT7, CCT8, CNS1, CPR3, CPR6, EPS], EROl, EUG1 9 
FMOh HCH1, HSP10, HSP12, HSP104, HSP26, HSP30, HSP42, HSP60, 
HSP78, HSP82, JEM1, MDJ1, MDJ2, MPD1, MPD2, PDIJ 9 PFD1, ABC1, 
APJh ATPlh ATP12, BTTJ, CDC 3 7, CPR7, HSC82, KAR2, LHS1 9 
MGE1, MRSlh NOBl 9 ECM10, SSAJ, SSA2, SSA3, SSA4, SSC1, SSE2, 
SIL1, SLS1, UBI4, ORMh ORM2, PERI, PTC2, PSEL HAC1 or truncated 
intronless HAC1, TIM9, PAM18 or TCP! or a variant or fragment of any 
one of these. 

A method according to any one of the preceding claims wherein the first 
chaperone is protein disulphide isom erase. 

A method according to any one of the preceding claims wherein the 
second chaperone is Orm2p or a variant or fragment thereof. 

A method according to any one of the preceding claims wherein at least 
one, preferably both, of the first or second chaperones is encoded by a 
chromosomally integrated recombinant gene. 

A method according to any one of the preceding claims wherein at least 
one, preferably both, of the first or second chaperones is encoded by a 
gene on a plasmid. 

A method according to any preceding claim wherein the third gene which 

encodes the desired protein (such as a desired heterologous protein) is 
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integrated in the chromosome of the host cell, or is provided on a plasmid 
within the host cell. 

A method according to Claim 10 or 1 1 wherein the plasmid is, or is not, a 
2um-family plasmid. 

A method according to Claim 12 wherein the plasmid comprises a gene 
encoding a protein comprising the sequence of the first chaperone protein 
and/or a gene encoding a protein comprising the sequence of the second 
chaperone protein, and a gene encoding a desired heterologous protein. 

A method according to Claim 12 or 13 wherein the plasmid is a 
disintegration vector. 

A method according to any preceding claim wherein the desired protein 
(such as a desired heterologous protein) comprises a leader sequence 
effective to cause secretion from the host cell, such as yeast. 

A method according to any preceding claim wherein the desired protein 
(such as a desired heterologous protein) is a eucaryotic protein, or a 
fragment or variant thereof, optionally a vertebrate or a fungal (such as a 
yeast) protein. 

A method according to any preceding claim wherein the desired protein 
(such as a desired heterologous protein) is a commercially useful protein, 
such as a therapeutically, diagnostically, industrially, domestically or 
nutritionally useful protein. 

A method according to any preceding claim wherein the desired protein 
(such as a desired heterologous protein) comprises a sequence selected 
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from albumin, a monoclonal antibody, an etoposide, a serum protein (such 
as a blood clotting factor, e.g. Factor VII, Factor VIII, Factor IX, Factor X 
and Factor XIII), antistasin, a tick anticoagulant peptide, transferrin, 
lactoferrin, endostatin, angiostatin, collagens, immunoglobulins, or 
immunoglobulin-based molecules or fragment of either (e.g. a dAb, Fab 5 
fragments, F(ab') 2 , scAb, scFv or scFv fragment), a Kunitz domain 
protein, interferons, interleukins, IL10, IL11, IL2, interferon a species and 
sub-species, interferon p species and sub-species, interferon y species and 
sub-species, leptin, CNTF, CNTF Axl5 \Axokine™), ILl-receptor 
antagonist, erythropoietin (EPO) and EPO mimics, thrombopoietin (TPO) 
and TPO mimics, prosaptide, cyanovirin-N, 5-helix, T20 peptide, T1249 
peptide, HIV gp41, HIV gpl20, urokinase, prourokinase, tPA, hirudin, 
platelet derived growth factor, parathyroid hormone, proinsulin, insulin, 
glucagon, glucagon-like peptides, insulin-like growth factor, calcitonin, 
growth hormone, transforming growth factor p, tumour necrosis factor, G- 
CSF. GM-CSF, M-CSF, FGF, coagulation factors in both pre and active 
forms, including but not limited to plasminogen, fibrinogen, thrombin, pre- 
thrombin, pro-thrombin, von Willebrand's factor, a i -antitrypsin, 
plasminogen activators, nerve growth factor, LAGI, platelet-derived 
endothelial cell growth factor (PD-ECGF), glucose oxidase, serum 
cholinesterase, aprotinin, amyloid precursor protein, inter-alpha trypsin 
inhibitor, antithrombin III, apo-lipoprotein species, Protein C, Protein S, a 
metabolite, an antibiotic, or a variant or fragment of any of the above. 

A method according to any preceding claim wherein the desired protein 
(such as a desired heterologous protein) comprises the sequence of 
albumin or a variant or fragment thereof. 

A method according to any preceding claim wherein the desired protein 
(such as a desired heterologous protein) comprises the sequence of a 
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transferrin family member, optionally transferrin or lactoferrin, or a variant 
or fragment thereof 

A method according to any preceding claim wherein the desired protein is 
a desired heterologous protein that comprises a fusion protein, such as a 
fusion protein of albumin or a transferrin family member or a variant or 
fragment of either, fused directly or indirectly to the sequence of another 
protein. 

A plasmid comprising a first recombinant gene encoding a protein 
comprising the sequence of a first chaperone protein and a second 
recombinant gene encoding a protein comprising the sequence of a second 
chaperone protein. 

A plasmid according to Claim 22 wherein at least one, and optionally both, 
of the first and second chaperones are chaperones as defined by any one of 
Claims 5 to 8. 

A plasmid according to Claim 22 or 23 further comprising a third 
recombinant gene, which third recombinant gene encodes a desired 
heterologous protein, optionally a desired heterologous protein as 
described by any one of Claims 15-21 . 

A plasmid according to any one of Claims 22 to 24 wherein the plasmid is 
a plasmid as defined by any one of Claims 12 to 14. 

A host cell comprising a first recombinant gene encoding a protein 
comprising the sequence of a first chaperone protein, a second 
recombinant gene encoding a protein comprising the sequence of a second 
chaperone protein and a third gene, such as a third recombinant gene, 
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encoding a desired protein (such as a desired heterologous protein), 
wherein the first and second chaperones are different. 

A host cell according to Claim 26, which host cell is a host cell as defined 
by any one of Claims 5 to 2 1 . 

A host cell comprising a plasmid as defined by any one of Claims 22-25. 

A method according to any one of Claims 1 to 21 or a host cell according 
to any one of Claims 26 to 28 wherein the host cell is a bacterial or yeast 
host cell. 

A method or a host cell according to Claim 29 wherein the host cell is a 
yeast cell, optionally a member of the Saccharomyces, Kluyver omyces, 
Arxula, Yarrowia, Candida, Schizosaccharomyces, Debaryomyces, 
Xanthophyllomyces, Geotrichum, Ashbya, Hortaea, Schwanniomyces, 
Trichosporon, Xanthophyllomyces, or Pichia genus, such as 
Saccharomyces cerevisiae, Kluyveromyces lactis, Pichia pastoris, Pichia 
membranaefaciens, Zygosaccharomyces rouxii, Zygosaccharomyces bailii, 
Zygosaccharomyces fermentatU Kluyveromyces drosphilarum, Pichia 
methanolica, Hansenula polymorpha (also known as Pichia augustd), 
Arxula adeninivorans, Yarrowia lipolytica, Candida boidinii Candida 
utilis or Schizosaccharomyces pombe. 

A host cell according to Claim 27, or as defined by Claim 10 or 11, in 
which the plasmid is a 2 jam-family plasmid and wherein: 

(a) the plasmid is based on pSRl, pSB3 or pSB4 and the host cell is 
Zygosaccharomyces rouxii; 

(b) the plasmid is based on pSBl or pSB2 and the host cell is 
Zygosaccharomyces bailli; 
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(c) the plasmid is based on pSMl and the host cell is 
Zygosaccharomyces ferm entati; 

(d) the plasmid is based on pKDl and the host cell is Kluyveromyces 
drosophilarum; 

(e) the plasmid is based on pPMl and the host cell is Pichia 
membranaefaciens; or 

(f) the plasmid is based on the 2\xm plasmid and the host cell is 
Saccharomyces cerevisiae or Saccharomyces carlsbergensis. 

32. A host cell according to Claim 31 in which the plasmid is based on the 
2 jam plasmid and the host cell is Saccharomyces cerevisiae or 
Saccharomyces carlsbergensis. 

33. A method for producing desired protein (such as a desired heterologous 
protein) comprising: 

(a) providing a host cell comprising a first recombinant gene encoding 
a protein comprising the sequence of Orm2p or a variant or 
fragment thereof and a second gene, such as a second recombinant 
gene, encoding a desired protein (such as a desired heterologous 
protein), with the proviso that the first and second genes are not 
present within the host cell on the same 2j_im-family plasmid; and 

(b) culturing the host cell in a culture medium to obtain expression of 
the first and second genes. 

34. A method according to Claim 33 wherein either or both of the first and 
second genes are expressed from a plasmid, and optionally from the same 
plasmid. 
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A method according to Claim 33 or 34 wherein the first recombinant gene 
encoding a protein comprising the sequence of Orm2p or a variant or 
fragment thereof is located on a plasmid. 

A method according to Claim 33 wherein either or both of the first and 
second genes are integrated into the chromosome of the host cell. 

A method according to Claim 33 or 36 wherein the first recombinant gene 
encoding a protein comprising the sequence of Orm2p or a variant or 
fragment thereof is integrated into the chromosome of the host cell. 

A method according to any one of Claims 33 to 37 further comprising the 
step of purifying the thus expressed desired protein (such as a desired 
heterologous protein) from the cultured host cell or the culture medium. 

A method according to Claim 38 further comprising the step of 
formulating the purified desired protein (such as a desired heterologous 
protein) with a carrier or diluent and optionally presenting the thus 
formulated protein in a unit dosage form. 

A host cell comprising a first recombinant gene encoding a protein 
comprising the sequence of Orm2p or a variant or fragment thereof and a 
second gene, such as a second recombinant gene, encoding a desired 
protein (such as a desired heterologous protein), with the proviso that the 
first and second recombinant genes are not present within the host cell on 
the same 2jam-family plasmid. 

Use of a recombinant gene encoding a protein comprising the sequence of 
Orm2p or a variant or fragment thereof to increase the expression of a 
desired protein (such as a desired heterologous protein) in a host cell, with 
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the proviso that the recombinant gene encoding a protein comprising the 
sequence of Orm2p or a variant or fragment thereof and the gene encoding 
the desired protein (such as a desired heterologous protein) are not co- 
expressed within the host cell from the same 2|Lim-family plasmid. 

A plasmid comprising a first recombinant gene encoding a protein 
comprising the sequence of Orm2p or a variant or fragment thereof and a 
second gene, such as a second recombinant gene, encoding a desired 
protein (such as a desired heterologous protein), with the proviso that the 
plasmid is not a 2jam-family plasmid. 

A method, use, host cell or plasmid according to any one of Claims 33 to 
42 wherein the desired protein (such as a desired heterologous protein) is 
as defined in any one of Claims 1 5 to 2 1 . 

A host cell comprising a plasmid, the plasmid comprising a gene that 
encodes an essential chaperone wherein, in the absence of the plasmid, the 
host cell is unable to produce the chaperone, the plasmid further 
comprising a recombinant gene encoding a desired heterologous protein, 
such as a desired heterologous protein as defined in any one of Claims 1 5 
to 21. 

A host cell according to Claim 44 wherein the essential chaperone is a 
eucaryotic chaperone. 

A host cell according to Claim 44 wherein the essential chaperone is a 
yeast chaperone. 

A host cell according to Claim 44 wherein the essential chaperone is a 
yeast chaperone comprises the sequence of a protein encoded by a gene 
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selected from CCT2, CCT3, CCT4, CCT5. CCT6, CCT7, CCT8. CMS], 
EROl, HSP10, HSP60, PDI1 , CDC37, KAR2, MGE1, MRS] 1. NOB J, 
SSC1, PSE1, TIM9, PAM18 and TCP1 or a variant or fragment of any one 
of these.. 

5 

48. A host cell according to Claim 44 wherein the essential chaperone is 
protein disulphide isomerase. 

49. A host cell according to any one of Claims 44 to 48 wherein, in the 
10 absence of the plasmid, the host cell is inviable. 

50. A host cell according to Claim 49 wherein, in the absence of the plasmid, 
the host cell is inviable and cannot be made viable by growing the host cell 
in a growth medium and making nutrient additions or modifications to that 

15 growth medium, and preferably cannot be made viable by growing the host 

cell in a growth medium and making any additions or modifications to that 
growth medium. 

51. A plasmid comprising, as the sole yeast selectable marker, a gene encoding 
20 a chaperone that is essential to the viability of a yeast host cell, in the sense 

that when the gene or genes encoding the essential chaperone are deleted 
or inactivated in a yeast host cell, then the host cell is inviable in culture 
and the deficiency cannot be complemented by additions or modifications 
to the culture medium. 

25 

52. A plasmid according to Claim 51 wherein the gene encoding the essential 
chaperone is the sole selectable marker encoded by the plasmid. 

53. A plasmid according to Claim 52 wherein the essential chaperone is a 
30 eucatyotic chaperone. 
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54. A plasmid according to Claim 52 wherein the essential chaperone is a 
yeast chaperone. 

55. A plasmid according to Claim 52 wherein the essential chaperone is a 
yeast chaperone comprises the sequence of a protein encoded by a gene 
selected from CCT2, CCT3, CCT4, CCT5, CCT6, CCT7. CCT8, CNS1, 
EROl, HSP10, HSP60, PDI1, CDC 3 7. KAR2, MGE1, MRSU, NOB1, 
SSCJ, PSE1, TIM9, PAM18 and TCP] or a variant or fragment of any one 
of these. 

56. A plasmid according to Claim 52, wherein the essential chaperone is 
protein disulphide isomerase or Pselp. 

57. A plasmid according to any one of Claims 51 to 56 further comprising a 
gene encoding a desired heterologous protein, such as a desired 
heterologous protein defined by any one of Claims 15 to 21. 

58. A plasmid according to any one of Claims 51 to 57 which is a 2|am-family 
plasmid. 

59. A plasmid according to any one of Claims 51 to 58 which is a plasmid as 
defined by any one of Claims 22 to 25. 

60. A host cell according to any one of Claims 44 to 50 in which the plasmid 
is a plasmid according to any one of Claims 51 to 59. 

61. A host cell according to Claim 28 wherein a chaperone encoded by the 
plasmid is an essential gene. 
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62. A host cell according to Claim 60 wherein, in the absence of the plasmid, 
the host cell does not produce the chaperone. 

63. A method according to Claim 1 wherein the host cell is a host cell as 
5 defined by Claim 61 or 62. 

64. A method according to Claim 63 wherein the step (b) of Claim 1 involves 
culturing the host cell in non-selective media, such as rich or complex 
media. 

10 

65. A method for producing a desired recombinant protein comprising the 
steps of: providing a host cell comprising a plasmid, the plasmid 
comprising a first recombinant gene that encodes a chaperone that is 
essential to the viability of the host cell (the "essential chaperone") and 

15 wherein, in the absence of the plasmid, the host cell is unable to produce 

the essential chaperone, which host cell is optionally a host cell as defined 
by any one of Claims 44 to 50 or 60 to 62, and further in which the host 
cell comprises a gene that encodes a desired protein: culturing the host 
cell in a culture medium under conditions that allow the expression of the 

20 essential chaperone and the desired protein; and optionally isolating the 

thus expressed desired protein from the cultured host cell or the culture 
medium; and optionally purifying the thus isolated desired protein to a 
commercially acceptable level of purity; and further optionally, 
lyophilising the thus purified protein or formulating the purified desired 

25 protein with a carrier or diluent (such as a pharmaceutically acceptable 

carrier or diluent); and optionally presenting the thus formulated desired 
protein in a unit dosage form. 

The method of Claim 65 wherein the step of culturing the host cell 
involves culturing the host cell in a non-selective media, such as a complex 
or rich media. 
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67. A method. for producing a desired protein (such as a desired heterologous 
protein) comprising the steps of: 

(a) providing a host cell as defined by any one of Claims 44 to 50; and 

(b) culturing the host cell in a culture medium under conditions that 
allow the expression of the essential chaperone and the desired 
protein. 

68. The method of Claim 67 wherein the host cell comprises a plasmid as 
defined by any one of Claims 51 to 59. 



69. The method of Claim 67 or 68 wherein step (b) of the method of Claim 67 
15 is performed by culturing the host cell in a non-selective medium, such as 

a rich or complex medium. 

70. Use of a polynucleotide comprising a sequence that encodes a chaperone 
that is essential to the viability of a host cell (the "essential chaperone"), to 

20 increase the stability of a plasmid in the host cell by integration of the 

polynucleotide into the plasmid to produce a modified plasmid, wherein 
the host cell is unable to produce the essential chaperone in the absence of 
the modified plasmid. 

25 71 . Use according to Claim 70 to increase the stability of a plasmid in the host 
cell when the host cell is grown under non-selective conditions, such as in 
a rich or complex medium. 

72. Use according to Claim 70 or 71 wherein the plasmid is a 2|am-family 
30 plasmid. 
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73. Use according to any one of Claims 70 to 72 wherein the plasmid further 
comprises a gene encoding a desired heterologous protein, such as a 
desired heterologous protein as defined in any one of Claims 15 to 21 . 

74. Use according to any one of Claims 70 to 73 wherein the essential 
chaperone is a eucaryotic chaperone. 

75. Use according to any one of Claims 70 to 73 wherein the essential 
chaperone is a yeast chaperone. 

76. Use according to any one of Claims 70 to 73 wherein the essential 
chaperone is a yeast chaperone comprises the sequence of a protein 
encoded by a gene selected from CCT2, CCT3, CCT4, CCT5, CCT6, 
CCT7, CCT8, CNS1, EROl, HSP10, HSP60, PDI1, CDC3 7, KAR2, 
MGE1, KdRSl 7, NOB1, SSC1, PSE1, TIM9, PAM18 and TCP1 or a variant 
or fragment of any one of these. 

77. Use according to any one of Claims 70 to 73 wherein the essential protein 
is a chaperone, such as protein disulphide isomerase or Pselp. 

78. Use according to any one of Claims 70 to 77 to simultaneously increase 
the stability of a plasmid in the host cell and increase the ability of the host 
cell to produce protein product. 

79. Use according to Claim 78 wherein the protein product is an endogenously 
encoded protein or a heterologous protein, such as a heterologous protein 
as defined in any one of Claims 15 to 21. 
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Use according to Claim 79 wherein the protein product is a heterologous 
protein that is encoded by a recombinant gene that has been integrated into 
the chromosome of the host cell. 

Use according to Claim 79 wherein the protein product is a heterologous 
protein that is encoded by a recombinant gene that is present on a plasmid 
in the host cell. 

Use according to Claim 81 wherein the plasmid that comprises the 
recombinant gene that encodes the heterologous protein is the same 
plasmid as the modified plasmid that comprises the polynucleotide that 
encodes the essential chaperone. 

A host cell, plasmid, method or use according to any one of Claims 44 to 
82 wherein the host cell is a bacterial or yeast host cell. 

A host cell, plasmid, method or use according to Claim 83 wherein the 
host cell is a yeast cell, optionally a member of the Saccharomyces, 
Kluyveromyces, Arxula, Yarrowia, Candida, Schizosaccharomyces, 
Debaryomyces, Xanthophyllomyces, Geotrichum, Ashbya, Hortaea, 
Schwanniomyces, Trichosporon, Xanthophyllomyces, or Pichia genus, 
such as Saccharomyces cerevisiae, Kluyveromyces lactis, Pichia pastoris, 
Pichia membranaefaciens, Zygosaccharomyces rouxii, 

Zygosaccharomyces bailiU Zygosaccharomyces fermentati, Kluyveromyces 
drosphilarum, Pichia methanolica, Hansenula polymorpha (also known as 
Pichia augusta), Arxula adeninivorans, YarroMna lipolytica, Candida 
boidinii Candida utilis or Schizosaccharomyces pombe. 

A host cell, plasmid, method or use according to Claim 84, in which the 
plasmid is a 2|^m-family plasmid and wherein: 
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(a) the plasmid is based on pSRL pSB3 or pSB4 and the host cell is 
Zygosaccharomyces rouxii; 

(b) the plasmid is based on pSBl or pSB2 and the host cell is 
Zygosaccharomyces bailli; 

(c) the plasmid is based on pSMl ' and the host cell is 
Zygosaccharomyces fermentati; 

(d) the plasmid is based on pKDl and the host cell is Kluyveromyces 
drosophilarum; 

(e) the plasmid is based on pPMl and the host cell is Pichia 
membranaefaciens; or 

(f) the plasmid is based on the 2\xm plasmid and the host cell is 
Saccharomyces cerevisiae or Saccharomyces carlsbergensis. 

A host cell, plasmid, method or use according to Claim 85 in which the 
plasmid is based on the 2|am plasmid and the host cell is Saccharomyces 
cerevisiae or Saccharomyces carlsbergensis. 

Use of a polynucleotide comprising the sequence of a promoter operably 
connected to a coding sequence encoding a chaperone for increasing the 
expression of a desired protein (such as a desired heterologous protein) in 
a host cell by expression of the polynucleotide sequence within the host 
cell, wherein the promoter is characterised in that it achieves a lower level 
of expression of the chaperone than would be achieved if the coding 
sequence were to be operably connected to its naturally occurring 
promoter, optionally wherein the use is a use as defined in any one of 
Claims 70 to 82. 

A method for producing a desired protein (such as a desired heterologous 
protein) comprising the steps of: 
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(a) providing a host cell comprising a recombinant gene that 
comprising the sequence of promoter operably connected to a 
coding sequence encoding a chaperone, the promoter being 
characterised in that it achieves a lower level of expression of the 
chaperone than would be achieved if the coding sequence were to 
be operably connected to its naturally occurring promoter, and the 
host cell further comprising a gene, such as a recombinant gene, 
encoding a desired protein (such as a desired heterologous protein); 
and 

(b) culturing the host cell in a under conditions that allow the 
expression of the chaperone and the desired protein (such as a 
desired heterologous protein); 

optionally wherein the method is a method according to any preceding 
method claim. 

89. The method of Claim 88 further comprising the step of purifying the 
desired protein (such as a desired heterologous protein) produced in step 
(b); and optionally lyophilising the thus purified protein. 

90. The method of Claim 89 further comprising the step of formulating the 
purified or lyophilised desired protein (such as a desired heterologous 
protein) with a carrier or diluent. 

91 . The method of Claim 90 further comprising the step of presenting the thus 
formulated protein in a unit dosage form. 
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